Doc. Ref. FP7 
Appl.No. 10/565,885 

PCX WORLD INTELLECTUAL PROPERTY ORGANIZATION ^BSHEIF 

International Bureau ^"H 1 ^ 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 : 

C12N 15786, 7/01, 5/10, A61K 39/00, 
39/145, 48/00 



Al 



(11) International Publication Number: WO 00/53786 

(43) International Publication Date: 1 4 September 2000 (14.09.00) 



(21) International Application Number: PCT/EP00/01903 

(22) International Filing Date: 3 March 2000 (03.03.00) 



(30) Priority Data: 
99104519.6 



6 March 1999 (06.03.99) 



EP 



(71) Applicant (for all designated States except US): ARTEMIS 

PHARMACEUTICALS GMBH [DE/DE]; Neurather Ring 
1, D-51063 KGln (DE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): HOBOM, Gerd [DE/DE]; 
Amdtstrasse 14, D-35392 Giessen (DE). FLICK, Ramon 
[DE/SE]; Lilla Hallsattravagen 28, S-137 93 Vasterhaninge 
(SE). MENKE, Anette [DE/DE]; Ockershauser Allee 7, 
D-35037 Marburg (DE). AZZEH, Maysa [IUDE]; Al- 
tenfeldsweg 32, D-35394 Giessen (DE). 

(74) Agents: HELBING, Jdrg et al.; Von Kreisler Selting Werner, 
Deichmannhaus am Dom, D-50667 Koln (DE). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG, 
BR, BY, CA, CH, CN, CR, CU, CZ, DE, DK, DM, EE, 
ES, FI, GB, GD, GE, GH, GM, HR, HU, ID, IL, IN, IS, JP, 
KE, KG, KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MA, 
MD, MG, MK, MN, MW, MX, NO, NZ, PL, PT, RO, RU, 
SD, SE, SG, SI, SK, SL, TJ, TM, TR, TT, TZ, UA, UG, 
US, UZ, VN, YU, ZA, ZW, ARIPO patent (GH, GM, KE, 
LS, MW, SD, SL, SZ, TZ, UG, ZW), Eurasian patent (AM, 
AZ, BY, KG, KZ, MD, RU, TJ, TM), European patent (AT, 
BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, 
MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, CI, CM, 
GA, GN, GW, ML, MR, NE, SN, TD, TG). 



Published 

With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: STABLE RECOMBINANT INFLUENZA VIRUSES FREE OF HELPER VIRUSES 
(57) Abstract 

The invention relates to a recombinant influenza virus for high-yield expression of incorporated foreign gene(s), which is genetically 
stable in the absence of any helper virus and which comprises at least one viral RNA segment being an ambisense RNA molecule (ambisense 
RNA segment) and containing one of the standard viral genes in sense orientation and a foreign, recombinant gene in anti-sense orientation 
or vice versa, in overall convergent arrangement. The present invention further provides a method for the production of said recombinant 
influenza virus, a method for constructing specific ribozyme-sensitive influenza carrier strains; pharmaceutical compositions comprising 
said recombinant influenza virus; and the use of said recombinant influenza virus for preparing medicaments for vaccination purposes 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


a 


Cote d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 




CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


IX 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 00/53786 



1 



PCT/EP00/01903 



Stable Recombinant Influenza Viruses Free of Helper Viruses 
Field of the Invention 

The invention relates to a recombinant influenza virus for high-yield 
expression of incorporated foreign gene(s), which is genetically stable in 
the absence of any helper virus; a method for the production of said 
recombinant influenza virus, a method for constructing specific ribozyme- 
sensitive influenza carrier strains; pharmaceutical compositions comprising 
said recombinant influenza viruses; and the use of said recombinant 
influenza virus for preparing medicaments for vaccination purposes. 

Technical Background 

Redesigning influenza virus into a vector system for expression of foreign 
genes similar to what has been achieved in several other thoroughly 
studied viruses such as adenovirus, retrovirus, Semliki Forest virus or 
Rabies virus has the advantage of an industrially well established mode of 
cheap propagation for influenza in fertilized chicken eggs leading to rather 
high titers (above 10 10 /ml). On the other hand none of its genes may be 
deleted from the influenza genome according to our present knowledge, 
and give room for large-size foreign insertions. Only small fragments of 
foreign polypeptide chains such as B cell epitopes (10 to 15 amino acids) 
may be inserted into selected positions within two of the viral proteins, i.e. 
in exchange for one of the variable antigenic regions located at the surface 
of hemagglutinin (Muster et al., Mucosal model of immunization against 
human immunodeficiency virus type 1 with a chimeric influenza virus. J. 
Virol. 69 (11), 6678-6686 (1995)), or into the stalk sequence of viral 
neuraminidase (Garcia-Sastre and Palese, The cytoplasmic tail of the 
neuraminidase protein of influenza A virus does not play an important role 
in the packaging of this protein into viral envelopes. Virus Res. 37, 37-47 
(1995)), and be stably maintained as functional fusion proteins. 
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Constructs of this kind turned out to be useful for experimental 
vaccination in a few cases studied, but only rather few clearly defined 
epitope sequences (of ten to twelve amino acids each) are known today, 
and some of them might also be misfolded within such restricted fusion 
5 protein positions, or in other cases interfere with the correct tertiary 
structure and function of their host polypeptide chains. 
Incorporation of a full-size foreign protein into influenza via reverse 
genetics, encoded by an independent ninth vRNA molecule in addition to 
its regular set of eight standard vRNA segments is without special 

10 provisions only transiently possible (Luytjes et al., Amplification, 
expression, and packaging of a foreign gene by influenza virus. Cell 59, 
1107-1113 (1989); Enami et al., An influenza virus containing nine 
different RNA segments. Virology 185, 291-298 (1991)). In the absence of 
a continuous selective pressure any additional recombinant vRNA segment 

15 cannot be stably maintained as long as a wildtype promoter sequence is 
used on that ninth vRNA segment, and it will inadvertently be lost after 
few steps of propagation. 

Using a different system of influenza reverse genetics developed in our 
laboratory (Zobel et al., RNA polymerase I catalysed transcription of insert 

20 viral cDNA. Nucleic Acids Res. 21, 3607-3614 (1993); Neumann et al., 
RNA polymerase I-mediated expression of influenza viral RNA molecules. 
Virology 202, 477-479 (1994)), which was built around in vivo synthesis 
of recombinant vRNA molecules by cellular RNA polymerase I transcription 
of the respective template cDNA constructs, promoter-up mutations have 

25 been designed by nucleotide substitutions (Neumann and Hobom, 
Mutational analysis of influenza virus promoter elements in vivo. J. Gen. 
Virol. 76, 1709-1717 (1995)). When these are attached to a recombinant 
ninth vRNA segment its increased transcription and amplification rate will 
not only compensate for the losses suffered spontaneously, but even 

30 cause accumulation of the foreign vRNA segment during simple viral 
passaging, in the absence of any selection. However, due to its over- 
replication relative to all of the regular influenza vRNA segments (which of 
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course are connected to wild-type promoter sequences) after catching up 
with the others the foreign segment will become over-abundant. This 
increasingly will result in viral particles that have incorporated several 
copies of recombinant vRNA, but no longer have a full set of all eight 
5 standard segments incorporated among an average of about 15 vRNA 
molecules present within a virion. Such particles are defective and will not 
result in plaque formation, hence after an initial increase of recombinant 
viral particles during the first steps of propagation a dramatic decrease is 
observed, usually at the third or fourth step of viral passaging, depending 

10 on the size of the recombinant vRNA and the level of the promoter-up 
mutation attached. A balanced situation with regard to the insert length 
and the level of promoter activity can be achieved, and has been 
propagated in a particular case over 11 passages, with essentially stable 
levels of recombinant viruses among a majority of helper viruses (around 

15 80%) during these steps. If a full-level promoter-up mutation is used 
(1104 or the new variant 1920, see below) a balanced-level propagation is 
reached in conjunction with a recombinant vRNA size of 4000 nucleotides 
(Maysa Azzeh, Ph.D. Thesis, Univ. Giessen (2000)). 

In all of these preparations, both in transiently achieved increased yields 
20 (up to 40% of recombinants after three or four steps of viral passage), 
and in a balanced propagation of recombinant influenza viruses (10 - 
20%) the respective viral progeny inadvertantly constitute mixtures with a 
majority of non-recombinant helper viruses. These result both from a 
statistical mode of packaging vRNA molecules into a virion (the ninth 
25 segment may not be co-packaged), and from the fraction of cells solely 
infected by helper virus. 

The problems of fractional yields and of instability during viral propagation 
of recombinant influenza are the problems to be solved with the present 
invention. 

30 
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Summary of the Invention 

Starting out from two observations in this laboratory which are discussed 
below and which concern two hitherto unsuspected properties of influenza 
viral RNA polymerase in its interaction with terminally adapted influenza- 
5 specific RNA molecules, a new technique for the construction of stable 
recombinant influenza viruses was found. 

As previously described in WO 96/10641 plasmid constructs are designed 
to generate influenza vRNA-like molecules in vivo by cellular RNA 
polymerase I transcription following plasmid DNA transfection into tissue 

10 culture cells, and to this end contain flanking rDNA promoter and 
terminator elements, externally located relative to any cDNA constructs in 
between. The resulting recombinant vRNA molecules are designed to 
contain 5' and 3' recognition sequences for influenza viral RNA 
polymerase, which however carry up to five nucleotide substitutions (in 

15 promoter-up mutant pHL1920) resulting in a substantial increase of 
expression over wildtype influenza promoter levels. While recombinant 
pseudoviral RNA is Initially transcribed by RNA polymerase I, further 
amplification and mRNA transcription depends on the presence of viral 
RNA polymerase and viral nucleoprotein in the cell, which generally are 

20 provided by infection of a helper virus. As a consequence the progeny viral 
yield will constitute a mixture of recombinant viruses together with a 
majority of wild-type helper viruses. 

In the new technique the recombinant vRNA-like molecules as transcribed 
by RNA polymerase I are constructed as ambisense RNA double segments, 

25 with one reading frame (an influenza gene) in sense and a second one (a 
foreign gene) in anti-sense orientation, or vice versa. In such constructs 
both reading frames will be expressed via the cap-snatching mode, even if 
at different levels. Again, infection by helper virus is required to provide 
the necessary viral early and late proteins for genetic expression and 

30 virion packaging. However, the particular helper virus used in the new 
method is a recombinant virus carrying 2x2 specifically designed ribozyme 
targets inserted into the flanking non-coding sequences of one of its eight 
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vRNA segments. The viral segment chosen to become ribozyme-sensitive 
is always identical to the one used in constructing the recombinant 
ambisense RNA molecule, as the viral carrier gene in covalent linkage with 
an additional foreign gene. 
5 Recombinant influenza viruses produced in this way through RNA 
polymerase I transcription of an ambisense viral RNA molecule followed by 
infection with that specifically designed type of ribozyme-sensitive helper 
virus will carry one of the influenza genes twice, once within that 
ribozyme-sensitive helper vRNA segment, and a second time within the 

10 ribozyme-resistant ambisense segment. Recombinant viruses of this type 
are again obtained initially only as a mixture together with a majority of 
non-recombinant helper viruses. A progeny viral passage through tissue 
culture cells (293T) which before have been transiently transfected with 
plasmid constructs expressing the respective double-headed ribozyme will 

15 (in one step) inactivate the ribozyme-sensitive segment by a factor of up 
to 100. One or two rounds of such ribozyme treatment in vivo will at the 
same time (a) purify the recombinant virus from its non-recombinant 
helper contaminants, and (b) delete the sensitive vRNA helper segment 
from within the initial (additive) recombinant virus. 

20 As a result recombinant influenza viruses are isolated along this several- 
step procedure, which are free of contaminating helper viruses and carry 
seven regular and one ambisense vRNA segments, all in a balanced 
replication mode. Their recombinant nature is stably maintained because 
of a covalent junction between one of the viral genes and the full-size 

25 foreign gene inserted, a situation achieved here for the first time, via 
constructing an influenza ambisense RNA segment. The whole procedure 
is independent of any (selectable) phenotype, and can be applied to either 
of the eight influenza vRNA segments. After establishing a first ambisense 
vRNA segment carrying a single foreign gene it can also be repeated all 

30 over for inserting a second foreign gene within another ambisense RNA 
segment of the same constitution in principle. 
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Stable recombinant viruses of the type described can be used for cheap 
propagation in fertilized eggs, either for production of those recombinant 
viruses themselves or for production of foreign proteins or glycoproteins 
encoded by them, and hence find application in (glyco)protein production 
5 or in providing vector systems for somatic gene therapy or in being used 
as vaccination agents. 

Thus, the present invention provides 

(1) a recombinant influenza virus for high-yield expression of incorporated 
10 foreign gene(s), which is genetically stable in the absence of any helper 

virus and which comprises at least one viral RNA segment being an 
ambisense RNA molecule (hereinafter "ambisense RNA segment") and 
containing one of the standard viral genes in sense orientation and a 
foreign, recombinant gene in anti-sense orientation, or Wee versa, in 
15 overall convergent arrangement; 

(2) a preferred embodiment of the recombinant influenza virus defined in 
(1) above, wherein one or more of the regular viral RNA segments, 
differing from said at least one ambisense RNA segment (hereinafter 
"modified regular segment"), comprises a vRNA encoding a foreign gene, 

20 preferably one or more of the regular viral RNA segments has (have) been 
exchanged for a vRNA encoding a foreign gene; 

(3) a preferred embodiment of the recombinant influenza virus defined in 
(1) and (2) above, in which the terminal viral RNA sequences of said one 
or more modified regular segments and/or of said at least one ambisense 

25 RNA segment, which are active as the promoter signal, have been 
modified by nucleotide substitutions in up to five positions, resulting in 
improved transcription rates of both the vRNA promoter as well as the 
cRNA promoter as present in the complementary sequence; 

(4) a method for the production of recombinant influenza viruses as 
30 defined in (1) to (3) above comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, in 
antisense, sense or ambisense design, 
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(b) followed by infection with an influenza carrier strain constructed to 
include flanking ribozyme target sequences in at least one of its viral RNA 
segments, and 

(c) thereafter selective vRNA inactivation through ribozyme cleavage; 

5 (5) a method of constructing influenza carrier strains carrying one or more 
ribozyme target sites (of type one) in vRNA flanking positions comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, carrying 
two different 3' promoter sequences in tandem, which are separated by a 
second type of ribozyme target sequence, and which carry the said 

10 internal ribozyme target sites of type one; 

(b) followed by infection of an influenza wildtype strain; 

(c) thereafter amplification through simple steps of viral propagation; and 

(d) finally isolation through removal of their external 3' promoter 
sequence by ribozyme cleavage through infection of cells expressing 

15 ribozyme type 2, followed by plaque purification, said method being 
suitable for the construction of an influenza carrier strain required for step 
(b) of (4) above; 

(6) a ribozyme-sensitive influenza carrier (helper) strain obtainable by the 
method of (5) above; 
20 (7) a pharmaceutical composition comprising a recombinant influenza 
virus as defined in (1) to (3) above; 

(8) the use of a recombinant influenza virus as defined in (1) to (3) above 
for preparing a medicament for vaccination purposes; 

(9) the use of a recombinant influenza virus as defined in (1) to (3) above 
25 for preparing agents for somatic gene therapy; 

(10) the use of a recombinant influenza virus as defined in (1) to (3) 
above for preparing agents for transfer and expression of foreign genes 
into cells (abortively) infected by such viruses; 

(11) the use of a recombinant influenza virus as defined in (1) to (3) 
30 above for preparing agents for transfer and expression of RNA molecules 

into cells infected by such viruses; 
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(12) a method for the production of proteins or glycoproteins which 
comprises utilizing a recombinant influenza virus as defined in (1) to (4) 
above as expression vector; 

(13) a method for preventing and/or treating influenza which comprises 
5 administering a recombinant influenza virus as defined in (1) to (3) above 

to the mammal to be treated, i.e., a vaccination method utilizing said 
recombinant virus; 

(14) a method for somatic gene therapy, which method comprises 
subjecting the organism to be treated with a recombinant influenza virus 

10 as defined in (1) to (3) above; 

(15) a method for transfer and expression of foreign genes into cells, and 
for transfer and expression of RNA molecules into cells, which method 
comprises infecting the cells with a recombinant influenza virus as defined 
in (1) to (3) above; 

15 (16) use of a recombinant influenza virus as defined in (1) to (3) above 
for preparing agents for autologous immunotherapy; 

(17) a method for an immunotherapy which comprises ex vivo infection of 
immune cells with a recombinant influenza virus as defined in (1) to (3) 
above, and introduction of the transduced cells into the patient; and 
20 (18) a method for the induction of antibodies which comprises utilizing, a 
recombinant influenza virus as defined in (1) to (3) above as an 
immunogen. 

Brief Description of the Figures 

25 Fig.l shows 3' nucleotide extensions of influenza vRNA template 
molecules. 

Fig. 2 shows propagation of recombinant influenza viruses with tandem 
bicistronic vRNA. 

Fig. 3 shows tandem bicistronic vRNA supporting an alternative mode of 
30 transcription and replication initiation 

Fig. 4 shows the outgrowth of promoter-up recombinant vRNA versus 
wildtype vRNA segments in stepwise propagation of influenza virus. 
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Fig. 5 shows pAM403 ribozyme cleavage of pHL2969 derived vRNA 
molecules at specific target sites inserted between an external and an 
internal 3" promoter sequence. 

Fig. 6 shows comparative cleavage analysis of model CAT vRNAs with 
5 tandem target sites in various flanking positions, by target-specific 
ribozymes. 

Fig. 7 shows an alignment of pAM424 double-headed ribozyme with one of 
their repetitive target sequences located within the 5' and 3' vRNA flanking 
regions. 

10 Fig. 8 shows pAM424 ribozyme cleavage of resistant FPV wild-type HA 
vRNA and ribozyme-sensitive pHL2969-derived HA-vRNA in 293T cells. 
Fig. 9 shows a functional analysis of the influenza cRNA promoter 
structure. 

Fig. 10 shows a functional analysis of the vRNA and cRNA promoter in 
15 ambisense arrangement. 

Fig. 11 shows a basepair substitution according to the vRNA 'corkscrew' 
structure. 

Fig. 12 shows a flow-chart of the isolation procedure for an ambisense 

recombinant influenza virus. 
20 Fig. 13 shows immuno-electron microscopy of purified influenza 

FPV/CSFV-E2-HA virions. 

Fig. 14 shows vector pHL2969. 

Fig. 15 shows vector pAM403. 

Fig. 16 shows vector pAM424. 
25 Fig. 17 shows vector pHL2507. 

Fig. 18 shows vector pHL2583. 

Fig. 19 shows vector pHL2989. 

Fig. 20 shows vector pHL1920. 

30 Detailed Description of the Invention 

According to the present invention "influenza virus" embraces influenza A 
virus, influenza B virus and influenza C virus, with influenza A virus being 
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preferred. A "mammal" according to the present invention includes 
humans and animals. "Organism" embraces prokariotic and eukariotic 
systems as well as multicellular systems such as vertebrates (including 
mammals) and invertebrates. "Infected cells" and "infecting cells" 
according to the present invention also include "abortively infected cells" 
and "abortively infecting cells", respectively. 

In a preferred influenza virus according to embodiment (1) at least one of 
the regular viral RNA segments is replaced by an ambisense RNA segment 
which contains one of the standard viral genes in sense orientation and a 
foreign, recombinant gene in anti-sense orientation or vice versa in overall 
convergent arrangement. It is moreover preferred that in the ambisense 
RNA molecule said foreign recombinant gene is covalently bound to one of 
the viral genes while the original vRNA segment coding for the same gene 
is deleted from the recombinant virus by a specific ribozyme cleavage. 

The foreign gene(s) in ambisense covalent junction with the viral gene(s) 
preferably code for proteins and/or glycoproteins which are secreted from 
cells infected with the recombinant virus, such as lymphokines, or code for 
glycoproteins that are incorporated into the virion as well as the plasma 
membrane of the infected cell. In another preferred embodiment the 
foreign gene(s) in ambisense covalent junction with the viral gene(s) code 
for proteins or artificial polypeptides designed to support an efficient 
presentation of inherent epitopes at the surface of infected cells, for 
stimulation of B cell and/or T cell response. Such proteins or artificial 
polypeptides constitute for instance a tumor antigen or an artificial 
oligomeric series of T cell epitopes. Finally, the foreign gene(s) may be 
suitable for transfer and expression of RNA molecules, including antisense 
RNAs and double stranded RNAs, into cells. Such recombinant influenza 
viruses are suitable for sequence specific gene silencing, for example by 
antisense or RNA interference mechanisms. 
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A preferred recombinant virus of embodiment (2) of the invention is where 
in the regular viral RNA segments one or both of the standard 
glycoproteins hemagglutinin and neuraminidase have been exchanged, 
preferably into foreign glycoprotein(s) or into fusion glycoproteins 
5 consisting of an anchor segment derived from hemagglutinin and an 
ectodomain obtained from the foreign source, viral or cellular, or in which 
such recombinant glycoprotein has been inserted as a third molecular 
species in addition to the remaining standard components. 

10 As set forth in embodiment (3) above, a preferred recombinant virus of 
the invention is where the terminal viral RNA sequences, which are active 
as promoter signal, have been modified by nucleotide substitution in up to 
5 positions, resulting in improved transcription rates (of both the vRNA 
promoter and in the cRNA promoter as present in the complentary 

15 sequence) as well as enhanced replication and/or expression rates relative 
to the wild-type sequence. Said modified terminal viral RNA sequences 
differ from the wild-type sequence in that they are containing at least one 
segment (a naturally occurring segment or an additional segment) 
wherein the 12 nucleotide conserved influenza 3' terminal sequence has 

20 been modified by replacement of one to three nucleotides occurring in said 
sequence at positions 3, 5 and 8 relative to the 3' end by other 
nucleotides provided that the nucleotides introduced in positions 3 and 8 
are forming a base pair (i.e., if the nucleotide position 3 is G, than that in 
position 8 is C; if the nucleotide in position 3 is C, than that in position 8 is 

25 G; etc.). 

The 3' conserved regions of the wild-type influenza virus have the 
following sequences: 
Influenza A: 5'-CCUGCUUUUGCU-3' 
30 Influenza B: 5'-NN(C/U)GCUUCUGCU-3' 
Influenza C: 5'-CCUGCUUCUGCU-3'. 
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Moreover, the 13 nucleotide conserved influenza 5'-termina! sequence 
may be modified by replacement of one or two nucleotides occurring in 
said sequence as positions 3 and 8 by other nucleotides, again provided 
that the introduced nucleotides are forming a base pair. The 5' conserved 
5 regions of the wild-type influenza virus have the following sequences: 
Influenza A: 5'-AGUAGAAACAAGG 
Influenza B: 5'-AGUAG(A/U)AACA(A/G)NN 
Influenza C: 5'-AGCAGUAGCAAG(G/A) : 



10 Preferred influenza viruses of the invention are those wherein in the 3' 
conserved region the replacements G3A and C8U have been performed, 
more preferred are those where also the replacement U5C has been 
performed (the above mutations are relative to the 3' end; such counting 
from the 3' end is also indicated by a line on top of the digit, e.g., G 3A). 

15 Another preferred influenza virus mutant comprises the 3'-terminal 
nucleotide sequence G3C, U5C and C8G (relative to the 3' end) giving the 
following 3' terminal nucleotide sequence S'-CCUCGUUCUCCU-S'. Among 
the influenza viruses defined hereinbefore those having a 3'-terminal 
nucleotide sequence of 5'-CCUGUUUCUACU-3' are most preferred. In case 

20 of an influenza A virus the segment may further have the modifications 
U3A and A8U in its 5' terminal sequence, in case of influenza C it may 
have the modifications C3U and G8A in its 5' terminal sequence. The most 
preferred influenza viruses of the present invention comprise the following 
general structures: 

25 Influenza A (mutant DHL 1104): 

5'-AGUAGAAACAAGGNNNU 5 -6-. (880-2300 ntds)..N , N'N'CCUCUUUCuACU-3' 

Influenza A (mutant DHL 1920): 

S'-AGMGAAUCAAGGNNNU-h,.. (880-2300 ntds)..N'N , N , CCUCUUUCuACU-3' 

Influenza A (mutant DHL 1948): 

30 5'-AGUAGAAACAAGGNNNUs-6.. (880-2300 ntds)..N'N , N , CCUCGUUCuCCU-3' 
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Influenza B: 

5'-AGUAG(A/U)AACA(A/G)NNNNNU 5 -6..(880-2300 ntds)..N , N , N , N , N , (C/U)GUUUCuACU-3' 

Influenza C: 

5 , -AGUAGUAACAAG(G/A)GU5-6.. (880-2300 ntds)..CCCCUGUUuCuACU-3' 

5 

In the above structures the variables are defined as follows: 

(1) Underlined and enlarged letters show the required mutations relative 

to the wild-type sequence for preparing a promoter mutant with enhanced 

properties; 

10 (2) enlarged A in the 5'-part of the sequence: additional A (position 10), 
angle-forming; 

(3) (A/G) at one position: different isolates or single segments with 
different sequence at this respective positions which are analytically 
interchangeable; 

15 (4) N and N': undefined, but base paired positions relative to each other in 
complementarity between the 5' and 3' termini, different among the 8 
segments, but constant for each segment throughout all viral isolates; 
(5) (880-2300 ntds): the lengths of the virus segments, in case of 
segments with foreign genes increased up to 4,000 nucleotides. 

20 

The pharmaceutical composition according to embodiment (7) above and 
the medicament of embodiment (8) above contains the recombinant 
influenza virus in a pharmaceutically effective amount. Besides said 
recombinant influenza virus, the pharmaceutical composition and the 
25 medicament may contain further pharmaceutically acceptable carrier 
substances well-known to a person skilled in the art, such as binders, 
desintegrants, diluents, buffers, preservatives, etc. The pharmaceutical 
composition and medicaments are solid or liquid preparations and are 
suitable to be administered orally, intravenously or subcutaneously. 

30 

The medicament according to embodiment (8) above is preferably suitable 
as a medicament against influenza and/or against other infections. The 
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recombinant influenza virus may be present in form of inactivated 
preparations or may be present in form of live recombinant viruses, 
preferably as attenuated viruses. 

5 Live recombinant viral vaccines, live but attenuated recombinant viral 
vaccines or inactivated recombinant viral vaccine can be formulated. 
Inactivated vaccines are "dead" in the sense that their infectivity has been 
destroyed. Ideally, the infectivity is destroyed without affecting its 
immunogenicity. To prepare inactivated vaccines, the recombinant virus 

10 may be grown in cell cultures or in embryonated chicken eggs, purified, 
and inactivated by formaldehyde or B-propiolactone. The resulting vaccine 
is usually administered intramuscularly. 

Inactivated viruses may be formulated with a suitable adjuvant to 
15 enchance the immunological response. Such adjuvants may include, but 
are not limited to mineral gels, e.g., aluminum hydroxide, surface-active 
substances such as pluronic polyols, lysolecithin, peptides, oil emulsions, 
and potential useful human adjuvants such as BCG. 



20 Many methods may be used to introduce the vaccine formulations above, 
for example the oral, intradermal, intramuscular, intraperitoneal, 
subcutaneous, or intranasal routes. Where a live recombinant virus 
vaccine is used, it may be preferable to introduce the formulation via the 
natural route of infection for influenza virus. 

25 

The medicament according to embodiment (8) above is preferably suitable 
for prophylactic or therapeutic vaccination, or both, against influenza and 
other infections. For example, recombinant viruses can be made for use in 
vaccines against HIV, hepatitis B virus, hepatitis C virus, herpes viruses, 
30 papilloma viruses, to name but a few. In one embodiment the 
recombinant virus contains the genes for surface proteins of the viruses, 
in another the genes for non-structural or regulatory genes. The 
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recombinant viruses may be present in form of inactivated preparations or 
may be present in form of live recombinant viruses, or in live, but 
attenuated viruses. In an attenuated virus the recombinant virus would go 
through a single or very few propagation cycle(s) and induce a sufficient 
5 level of immune response, but would not cause disease. Such viruses lack 
one of the essential influenza genes or contain mutations to introduce 
temperature sensitivity. 



The agents of embodiments (9)-(ll) of the invention are applicable in ex 
10 vivo and in vivo application schemes. The RNA molecule to be expressed 
by means of the agent of the embodiment (11) is of an antisense 
sequence or double strand sequence (in ambisense bidirectional 
transcription) relative to a target cellular mRNA molecule. In embodiment 
(11) the agent is preferably suitable for sequence-specific gene silencing, 
15 preferably by antisense RNA or RNA interference mechanisms. 

The method for the production of proteins or glycoproteins is preferably 
performed in cell culture cells or in fertilized chicken cells in accordance 
with standard techniques within the general knowledge of a person skilled 
20 in the art. The proteins or glycoproteins to be expressed are those 
incorporated into the ambisense construct as defined hereinbefore. 

The methods according to embodiments (13) to (15), (17) and (18) of the 
invention include the administration of an effective amount to the mammal 
25 or the administration of a sufficient infective dose of the recombinant virus 
to the cell system that is used for ex vivo therapy or for in vitro 
investigations, whereby the amount and dose will be determined by a 
parson skilled in the respective arts or knowledgeable of the desired 
treatments. 

30 

The agent of embodiment (16) of the invention is preferably utilized to 
infect, transfect or transduce patient-derived immune cells. The agent is 
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suitable for treatment of cancer or chronic viral infections. For this 
purpose, patient derived immune cells, preferably dendritic cells, are ex 
vivo infected with recombinant influenza viruses expressing, e.g., tumor 
antigens or viral antigens. The transduced cells are then reintroduced into 
5 the patient. 

The preferred method for immunotherapy of embodiment (17) of the 
invention is an autologous immunotherapy, wherein the cells which are ex 
vivo infected are patient-derived and the transduced cells are reintroduced 
10 into the patient. The diseases to be treated by this method include cancer 
and chronic viral infections. For details regarding such treatment see 
discussion of embodiment (16) above. 

The method for inducing antibodies according to embodiment (18) of the 
15 invention is suitable for inducing antibodies to foreign proteins including 
glycoproteins, following the administration of protein or glycoprotein 
antigens as part of a recombinant influenza virus in an authentic 
conformation, whereby the virus is purified by gentle procedures based on 
hemagglutination, and the gene is expressed at high rates in the infected 
20 cells. 

As influenza viruses have a wide host range, recombinant influenza 
viruses can be used to obtain strong immune responses in, and isolate 
antibodies from, a wide range of animals, including, but not limited to, 

25 fowl, pigs, goats, horses, and mice. Further, influenza viruses adapted to 
the mouse can be used for the infection of mice, by several routes 
including the intranasal route. This results in infection of the pharyngeal 
mucosal cells and results in an additional type of B cell response (e.g., as 
recognized in the ratio of IgG to IgA). Mice are of particular utility in the 

30 induction of immune responses in transgenic mice that have been 
engineered to express human antibodies. As gentle procedures based on 
hemagglutination are used to purify influenza viruses, antibodies to 
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antigens in native conformation can be isolated from the infected 
mammals. 

Further preferred embodiments of the invention are set forth herein 
5 below. 

A. Construction of influenza helper virus strains carrying ribozvme 
target sequences in flanking positions within either of the vRNA 
segments 

10 

A.l: Influenza RNA polymerase will initiate transcription and replication 
from promoter structures located at internal positions in an RNA molecule, 
not only from the natural position at both ends of a vRNA molecule: 
This is true in particular for promoter-up variants in RNA-internal location 

15 due to their enhanced binding affinity for viral polymerase. Not only 3' end 
extensions are tolerated in RNA-internal promoter recognition (Fig. 1), but 
also 5' extensions as well as extensions at both ends of the RNA template 
molecule, containing noncomplementary as well as complementary 
sequence, i.e. potentially present as a double-stranded extension. Finally, 

20 also an extension by way of duplication of the promoter sequence (active 
or inactivated) leads to mRNA transcription and CAT expression, initiated 
from the active pair of 5' and 3' promoter halves, irrespective if in external 
or in internal or even in an oblique localization (active 5' promoter 
sequence In external, active 3' promoter sequence in internal position, or 

25 vice versa). RACE-determination of the resulting 5' and 3' ends of viral 
mRNA and cRNA, i.e. the products of transcription and replication 
reactions for several of the extended template vRNA constructs proves an 
exact recognition and sequence-specific initiation at a position equivalent 
to regular 3' position 1: all of the various template extensions are lost in 

30 every product RNA molecule, most likely after only one round of 
replication. 
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A.2: Bicistronic (tandem) vRNA molecules carrying an additional 
3' specific promoter sequence in a central position between its two genes 
can be used for an indirect selection method for recombinant influenza 
5 viruses: 

The method described is applicable for any foreign gene (e.g. CAT) 
without a selection potential of its own, if inserted into the distal mRNA 
position (proximal vRNA position, in anti-sense orientation) behind a 
carrier gene (e.g. GFP) in the proximal mRNA position, able to serve as a 

10 transient selection marker. The carrier gene which is used for selection will 
get lost spontaneously during further propagation. These constructs are 
equivalent to a 3' extension of the template vRNA by a full-size gene of 
750 nucleotides up to a second 3' promoter sequence, in terminal location. 
While in the set of experiments shown in Fig. 2 the external 3' promoter 

15 sequence is maintained throughout as the same promoter-up variant 
(1104), the internal 3' promoter sequence has been varied to include a 
full-level promoter-up variant (pHL2270, containing promoter mutant 
1104), a medium-level promoter variant (pHL2350, containing promoter 
mutant 1948), a wildtype promoter construct (pHL2629), and a construct 

20 carrying an unrelated central sequence in an otherwise identical design 
(PHL2300). 

Due to the presence of two 3' promoter sequences in conjunction with a 
single 5' promoter sequence an alternating interaction between them will 
constitute one or the other active promoter structure (see Fig. 3). While 

25 the external location with an adjacent RNA 3' end may have a structural 
advantage, this appears to be compensated by the shorter distance in an 
interaction between the 5' sequence and the central 3' sequence in 
constituting the internal promoter, such that the competition between the 
two primarily reflects the various internal 3' promoter allele sequences 

30 used, compare Fig. 2B and activity ratios indicated above and below the 
lanes. Translation of the mRNA-distal gene (CAT) is only observed 
following an internal initiation at the bicistronic vRNA template, resulting in 
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a spontaneous deletion of the mRNA-proximal (vRNA-distal) gene, GFP, 
compare right half of Fig. 3. In complementary analyses GFP fluorescence 
is observed initially for all of the bicistronic constructs, but gets lost on a 
faster rate from pHL2270 transfected and influenza infected cells (not 

5 shown), and will stay unchanged in pHL2300-treated cells. The indirect 
selection system based on bicistronic (tandem) molecules as designed 
here and demonstrated for reporter genes GFP and CAT can be used for 
any other gene without distinct phenotype upon insertion behind an 
unrelated carrier gene with properties useful in selection. - In employing 

10 that technique an initial phase of (repeated) positive selection for infected 
cells expressing that proximal trait (e.g. by FACS or by magneto-beads) 
will be followed by a second phase with negative selection, i.e. against 
that fraction of infected cells still exposing the same property. 

15 A.3: Isolation of an influenza strain designed to carry 2x2 flanking 
ribozyme target sequences at the 5' and 3' end of vRNA segment 4 coding 
for hemagglutinin: 

The above scheme for an indirect selection of any foreign recombinant 
gene behind a proximal carrier gene is further modified by deleting the 

20 carrier gene altogether. Instead, both 3' terminal promoter sequences 
(mutant and wildtype) follow each other at a short distance, separated 
only by a specific, repetitive ribozyme target sequence, - different from 
other target sequences to be described further below. The cDNA insert 
following after the second 3' promoter sequence consists mainly of a 

25 regular hemagglutinin (H7) coding sequence, however both the 5' and 3' 
vRNA terminal regions of the insert carry that other ribozyme target 
sequence (different from the first target sequence mentioned above) 
inserted in either location in tandem duplication (pHL2969, see Fig. 14). 
Due to a superior replication supported by the promoter-up variant located 

30 in 3' external position the recombinant HA segment attached to that 
promoter sequence will become enriched during the first steps of viral 
propagation, while the originally dominating HA segment of the helper 
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virus which is under control of a wildtype promoter sequence is 
consecutively reduced and finally is no longer detectable among viral 
progeny. This result is documented by RT-PCR analysis of consecutive viral 
populations as obtained in that stepwise propagation procedure, see Fig. 
5 4. 

In the next step the viral lysate is twice passaged via infection of cell 
culture cells (293T) that before have been DNA-transfected by plasmid 
pAM403 (Fig. 15). This construct has been specifically designed to express 
a hammerhead ribozyme with flanking sequences complementary to the 

10 repetitive GUC-containing target sequence, as present twice in between 
the external and internal 3' promoter signals in the recombinant HA vRNA 
segment, see Fig. 5. In this way the extra promoter sequence is cut off 
from the finally resulting recombinant HA vRNA. Its promoter-up activity 
was useful in achieving an initial increase in the concentration of 

15 recombinant HA vRNA over wildtype HA vRNA, and in finally excluding the 
latter from further propagation. However, for the same reason that high 
activity of the promoter variant will cause instability in the resulting viral 
progeny, and an effective 'substitution' at this time through ribozyme 
cleavage by the internally located promoter signal, wild-type or slightly 

20 improved, will restore stability to the progeny viruses, with all of their 
eight vRNA segments now brought back in balance to each other. Due to 
the ribozyme cleavage site at 26 nucleotides 3' of the wild-type promoter 
sequence (see Fig. 5), in the initial stage that promoter signal is situated 
in a vRNA-internal location, extended by a 3' terminal sequence of 26 

25 nucleotides. According to the data presented in Fig. 1 this should cause a 
transient slight reduction in activity, resulting however in one step in 
regular viral mRNAs and cRNAs, with any initially remaining extra 
sequence being lost from the finally resulting recombinant HA vRNA. 
Progeny viruses still carrying an external promoter-up sequence (before 

30 ribozyme treatment or due to incomplete reaction) will not cause any 
plaque, due to over-replication of one vRNA segment relative to all others 
which results in a high load of defective particles. However, progeny 
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viruses which have lost that external promoter element due to ribozyme 
cleavage will yield regular plaques due to a balanced mode of replication 
for all eight wild-type or recombinant vRNA segments. Hence plaque 
purification is used for isolating a pure influenza viral strain carrying 2x2 

5 ribozyme targets in its recombinant HA vRNA segment, with its termini 
reduced to the wild-type promoter sequence. The nature of the viral strain 
isolated has been confirmed in this regard by RT-PCR analysis, see Fig. 4. 
The above isolation procedure resulting in influenza viral strains carrying 
2x2 flanking ribozyme target sequences has been performed twice for the 

10 HA coding segment (segment 4) to obtain two different isolates with 
regard to the orientation of the ribozyme target sequences. In one of the 
isolates (vHM41, see SEQ. ID NO: 3) the tandem target sites have been 
inserted into the HA vRNA non-translated sequence both in 5' and 
3' location, while according to the second design that 5' tandem target 

15 sequence has been included in an inverted orientation, such that it is now 
present in the cRNA 3' sequence instead (vHM42). 

In another experiment the same procedure was used to isolate an 
influenza strain carrying 2x2 tandem target sites within the 5' and 
3' flanking positions of segment 8 vRNA, i.e. coding for genes NS1 and 
20 NS2 (vHM81; see SEQ.ID NO:4). And in principle the same could be done 
for any other influenza segment, in particular since only the reading frame 
cDNA sequence has been exchanged from HA to NS, with all of the 
flanking elements directly responsible for that procedure remaining in 
place, unchanged. 

25 

A.4: Ribozyme cleavage and vRNA segment exchange using ribozyme- 
sensitive influenza strains as helper viruses: 

In an initial model experiment a range of ribozyme type and target site 
locations was probed in designing a series of CAT reporter gene vRNA 
30 constructs (analysed in the presence of a surplus of wildtype helper virus) 
in 293T cells. While all of the ribozyme constructs adhered to the 
hammerhead model, with 10 to 12 nucleotides of complementary 
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sequence flanking on either side of the GUC target site, these ribozyme 
constructs varied from monomer to dimer to trimer repetitions. Ribozyme 
containing mRNAs were synthesized in vivo via the basic vector plasmid 
pSV2-A7eo, i.e. using the efficient p S ve RNA polymerase II promoter 

5 element for expression, and the SV40 origin signal for plasmid 
amplification, in a cell line (293T or cos-1) with an incorporated SV40 
T antigen gene. In addition the pSV2-neo mRNA includes the small, 63 
nucleotide intron sequence of the SV40 early mRNA which is supposed to 
be spliced very slowly, thereby extending the pre-mRNA half-live in the 

10 nucleus. Each of the pSV2-neo-ribozyme plasmid constructs was 
transfected into 293T cells. Thereafter, recombinant viruses containing 
dimer target sites either near one end of the molecule only, or near both 
ends have been used for infection of the transfected cells. Relative 
activities of ribozyme constructs versus vRNA target sites have been 

15 measured via inactivation of CAT acetyl transfer rates in the cell lysates 
obtained at 8 h post infection (Fig. 6). The highest activities were 
observed for dimer ribozymes acting on vRNA molecules carrying 2x2 
target sequences on both ends of the molecule, either in vRNA 5' and 3' 
location, or in vRNA 3' and cRNA 3' location, i.e. with an inversion of the 

20 target site sequence at the vRNA 5" end. 

Consequently, the two constructs carrying tandem ribozyme double 
targets within both of their non-translated vRNA flanking sequences have 
been used in the design of ribozyme-sensitive influenza virus strains as 
described above, with both variants isolated for segment 4 (HA), and one 

25 of them for segment 8 (NS). In complementary correspondence the 
hammerhead ribozyme plasmid used has also been constructed as a 
double-headed structure with flanking sequences as shown in Fig. 7 
(pAM424; for its detailed structure, see Fig. 16). 

30 The three target site-containing influenza strains isolated as described 
above have been analysed for their sensitivity against ribozyme cleavage 
by infection of 293T cells, which had been DN A- transfected 18 h earlier by 
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ribozyme-producing plasmid pAM424, at DNA-transfection rates between 
60 and 70 % (as observed in parallel transfections using GFP-expressing 
plasmid pAM505). Inactivation rates in these one-step control experiments 
were between 90% and 99% for all three ribozyme-sensitive strains, in 
5 their extent mainly depending upon the actual DNA-transfection rates 
achieved in individual experiments. 

In the next step both isolates of HA-coding ribozyme-sensitive viruses, 
vHM41 and vHM42, have been used in marker-rescue experiments. Here, 
293T cells have been first DNA transfected by HA-variant cDNA construct 

10 pHL2507 (see Fig. 17), followed after 18 h by vHM41 or vHM42 virus 
infections at moi 1. The resulting viral supernatant containing e.g. a 
mixture of ribozyme-sensitive vHM41 carrier virus and pHL2507/vHM41 
recombinant virus is propagated in an intermediate step on MDCK cells, 
which also results in an increase in the fraction of recombinant viruses. 

15 Thereafter the resulting virus-containing supernatant is passaged through 
293T (or cos-1) cells, which in advance have been transiently transfected 
by ribozyme-producing pAM424. As may be concluded from the above 
experiment (Fig. 6) and shown in Fig. 8 vHM41- or vHM42-derived 
ribozyme-sensitive HA vRNA segments are expected to be inactivated by 

20 pAM424- produced ribozyme down to a remaining level of about 1% to 
10% (mainly present within cells that are infected, but not DNA- 
transfected). 

Instead, the substitute HA vRNA which originated from pHL2507 plasmid 
25 DNA (ribozyme- resistant) becomes exclusively incorporated into progeny 
virions. For further purification and viral propagation these have been 
passaged a second time through 293T cells, again in advance DNA- 
transfected by pAM424, and after another amplification step on MDCK 
cells the resulting viral preparations have been used for RT-PCR analysis. 
30 The resulting viral populations in these marker rescue experiments only 
contain HA-vRNA molecules derived from pHL2507, which in their PCR 
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analyses are of intermediate size relative to wildtype HA-vRNA, and 
vHM41- or vHM42-derived HA-vRNAs, respectively. 

Consequently, a set of ribozyme-sensitive influenza strains with targets 
inserted individually into every vRNA molecule may be used for such one- 
5 step marker rescue experiments in general, i.e. for vRNA segment 
exchange reactions performed in a directional way for any of the eight 
influenza vRNA segments, without requirement for a selectable change in 
phenotype (genetic marker). 

10 B. Expression of two gene products from ambisense bicistronic 
influenza vRNA 

B.l: The influenza cRNA promoter is active in antisense viral mRNA 
transcription according to the cap-snatching mode of initiation: 

15 While the vRNA template of influenza virus is known to be active in viral 
mRNA as well as cRNA synthesis, the cRNA template has been described 
so far only to produce vRNA molecules, as a second step in viral 
replication. The potential activity of the cRNA promoter in initiating also 
viral mRNA transcription has not been analysed or even suspected so far, 

20 since no antisense (vRNA) reading frame can be detected in any of the 
viral RNA segments. Also, no Us/U 6 template sequence element is present 
in any of the viral cRNAs in an adjacent position to its 5' promoter 
structure as is the case for all viral vRNAs. This element is known to serve 
as a template sequence for mRNA terminal poly-adenylation, in repetitive 

25 interaction. However, when both elements are provided through 
reconstruction of an artificial influenza cRNA segment: a reading frame in 
opposite orientation (CAT), and a U 6 template element in 5' adjacent 
location, CAT expression can indeed be observed, see pHL2583 (see Fig. 
18) in Fig. 9. Similar to the vRNA promoter the cRNA promoter activity is 

30 improved by (the same) promoter-up mutations, which essentially consist 
of basepair exchanges according to the 'corkscrew' model. This model 
apparently also holds for the cRNA promoter structure as analysed in a 
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stepwise manner in Fig. 9. While the cRNA promoter has to be superior 
over the vRNA promoter in its initiation of replication, since the 
vRNA/cRNA product ratio was determined to be around 10:1 (Yamanaka 
et al., In vivo analysis of the promoter structure of the influenza virus RNA 
genome using a transfection system with an engineered RNA. Proc. Natl. 
Acad. Sci. USA 88, 5369-5373 (1991)), the cRNA promoter is observed to 
be inferior to the vRNA promoter in its initiation rate of transcription 
(compare pHL2583 with pHL1844 in Fig. 9), at least for all promoter 
variants tested so far. 

A RACE analysis for determination of the 5' ends in pHL2583 cRNA 
promoter transcribed mRNAs proved this initiation to occur according to 
the cap-snatching mode, in complete equivalence to standard vRNA 
promoter controlled transcription initiation. 

B.2: Development of ambisense influenza constructs for consecutive 
expression of two genes (GFP and CAT) from a single viral RNA: 
For a bidirectional transcription and translation of influenza RNA segments 
the two reporter genes GFP and CAT have been arranged in opposite 
orientation to each other, and the flanking 5' and 3' promoter sequences 
(adhering to promoter-up variant 1104) had to be reconstructed to include 
a U 6 poly-adenylation element in either orientation in a 5' promoter 
adjacent position. This requirement necessarily resulted in a promoter- 
adjacent 5'-U6/3'-A6 complementary structure, both in the vRNA and cRNA 
terminal sequence (see Fig. 10), which had to be tested for its promoter 
activity, in either orientation. Therefore, the convergent pair of reporter 
genes GFP and CAT has been inserted in both orientations, such that 
CAT transcription is initiated by the vRNA promoter in one construct 
(pHL2960), and by the cRNA promoter in the other (pHL2989, Fig. 19), 
and vice versa for GFP expression from both ambisense constructs. In 
addition, also the CAT gene only has been inserted in either orientation 
between the 5' and 3' elements of that ambisense promoter, with CAT 
transcribed by the vRNA promoter in one case (pHL2959), and by the 



WO 00/53786 PCT/EP00/01903 

26 

cRNA promoter in the other (pHL2957). The whole set of constructs allows 
for a direct comparison with corresponding reference constructs carrying a 
regular vRNA promoter (pHL1844) or cRNA promoter structure (pHL2583), 
i.e. carrying only the 5'-adjacent U6 sequence element and no 3'-A 6 
5 counterpart. The two groups of constructs also differ in insert size, since a 
single inserted gene roughly accounts for 750 nucleotides, the convergent 
set of two genes for 1500 nucleotides, with the distal half of both mRNAs 
in this case remaining untranslated, a situation unusual for influenza viral 
mRNAs. 

10 As is demonstrated in Fig. 10B for CAT expression of the various 
ambisense constructs all of them are able to initiate transcription in both 
orientations, even if at different levels with regard to their vRNA and cRNA 
promoter-dependent expressions, and also with regard to the insert 
lenghts and convergent arrangements of the GFP/CAT versus CAT-only 

15 constructs. Analysis of the GFP expression rates (not shown) yields 
complementary results, i.e. again vRNA promoter-controlled GFP 
expression is superior over cRNA promoter expression of GFP. Therefore, 
individual ambisense clones either show an asymmetric high expression of 
GFP and low expression of CAT (pHL2960) or vice versa (pHL2989), 

20 depending on their orientation of reading frames with regard to the 
external vRNA and cRNA promoter. Fig. 10C also demonstrates successful 
propagation of recombinant viruses containing ambisense RNA molecules, 
which proves survival through amplification, packaging into virions, and 
expression of both mRNAs in infected MDCK cells (including besides CAT 

25 also GFP expression). 

B.3: Construction of a superior promoter-up mutation, pHL1920, to be 
used for improved rates of cRNA promoter expression in ambisense 
constructs: 

30 An extended analysis of promoter variants, in particular of complementary 
double exchanges according to the 'corkscrew' model yielded among 
others variant pHL1920 (Fig. 20) with CAT activity rates considerably 
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above (125-130% of) the rates observed for standard promoter-up 
variant '1104' (as present in pHL1844). The '1920' promoter-up variant 
consists of altogether 5 nucleotide substitutions relative to the wildtype 
promoter sequence, both in the 5' promoter element (2), and the 

5 3' promoter element (3). The structure of this variant and the whole set of 
complementary double exchanges is presented in Fig. 11, together with 
the respective CAT activity measurements, in vRNA promoter constructs. 
vRNA promoter-up variants also show similarly improved expression in 
(ambisense) cRNA constructs, even if at generally lower levels than in 

10 vRNA constructs. cRNA promoter-up expression is observed at levels 
similar or somewhat (2x-5x) above the wild-type vRNA promoter rate, 
while vRNA promoter-up constructs show CAT expression rates increased 
up to 20 or 25 times the wild-type vRNA promoter level. In either case 
expression rates also depend on the size of the insert, with promoter 

15 activity rates decreasing with increasing lengths of the influenza RNA 
molecules to be transcribed. 

B.4: Influenza recombinant viruses containing a foreign gene (CAT) in 
covalent ambisense linkage with one of the viral genes (HA, NS1/NS2): 

20 The principle solution in designing stable recombinant viruses based on 
the new properties observed for influenza transcription and replication 
signals consists in constructing viruses which contain a foreign gene in 
covalent linkage with one of the (indispensable) viral genes, in ambisense 
bicistronic organization. Preferably the viral gene is connected to the cRNA 

25 promoter, while vRNA promoter expression is used for expression of the 
foreign gene at rates considerably above the viral mRNA synthesis. The 
promoter-up variant chosen for constructing the ambisense RNA segment 
intends to bring its cRNA promoter expression (approximately) into 
balance with all other viral gene expression levels, which are controlled by 

30 wild-type vRNA promoters located at the termini of the seven ordinary 
influenza segments; the respective choice has to take into consideration 
the overall length of the ambisense segment. 
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Isolation of the ambisense recombinant virus employs an RNA 
polymerase I-transcribed ambisense cDNA construct, which will give rise 
in vivo to ambisense cRNA-type molecules, see Fig. 12. The plasmid DNA 
transfection mixture used in this step with 293T cells in addition may or 

5 may not contain four 'booster' plasmids which under pcMv-control produce 
the four early influenza proteins from non-viral mRNAs: NP, plus PB1, PB2, 
and PA, i.e. the three subunits of viral polymerase (Pleschka et al., A 
plasmid-based reverse genetics system for influenza A virus J. Virol. 70, 
4188-4192 (1996)), which will increase in a pre-amplification step the 

10 copy number of that ambisense viral cRNA segment. At 18 h post 
transfection the 293T cells are infected by a ribozyme-sensitive influenza 
strain, e.g. vHM41, which will supply (again) early and also late viral 
RNAs. The resulting supernatant which contains a mixture of vHM41 
carrier virus and vHM41-derived ambisense recombinant virus (nine vRNA 

15 segments) is then passaged directly or via an intermediate step of 
amplification on MDCK cells onto 293T cells that have in advance been 
DNA-transfected by ribozyme-producing pAM424. Here, the ribozyme- 
sensitive vRNA segment of vHM41 will be cleaved at its 2x2 target sites by 
pAM424 specific ribozymes. In recombinant viruses the vRNA gene lost in 

20 this way is re-supplemented through its presence within the ambisense 
segment. The virus-containing supernatant is passaged for amplification 
and further purification through ribozyme treatment a second time on 
293T cells which again have been pretreated by pAM424 DNA transfection. 
Absence of ribozyme-sensitive vRNA, and presence only of ambisense RNA 

25 in RT-PCR analysis at this stage allows for further amplification on MDCK 
cells and a final virus stock preparation on embryonated chicken eggs. 
CAT assays can be used to analyse for the presence and monitor the 
activity of this model foreign gene through the various steps of isolation 
and propagation as well as document technical improvements that might 

30 be worked out for one or more of the processive stages. 
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C. Examples for application of helper-free, stable recombinant influenza 
viruses 

C.l: Incorporation of reporter gene GFP in NS/GFP or HA/GFP ambisense 
5 segments: 

Recombinant viruses of this type will allow to follow-up on influenza 
infection instantly and continuously in individual infected cells, which may 
also be counted or documented by fluorescence photography or FACS 
sorting. With improved temperature resistance (Siemering et al., 

10 Mutations that suppress the thermosensitivity of green fluorescent protein. 
Curr. Biol. 6, 1653-1663 (1993)) GFP expression becomes visible at 2 h 
after infection and shows bright fluorescence after 4 h p.i., it will be 
possible to follow-up on the spread of viral infection by GFP fluorescence. 
Stable fluorescence in abortively infected cells e. g. observed in ex vivo 

15 treatment of dendritic cells similarly supports a follow-up on their 
reincorporation into animals; other genes may be incorporated by 
ambisense vRNA into dendritic cells in the same way. 

C2: Construction of glycoprotein CSFV-E2 carrying influenza, helper- 
20 free: 

The glycoprotein E2 of CSF virus has been incorporated previously into 
influenza both as an HA-anchor fusion protein within the viral envelope, 
and as an additional, unstable ninth vRNA segment into its genome (Zhou 
et al.; Membrane-anchored incorporation of a foreign protein in 

25 recombinant influenza virions. Virology, 246, 83-94 (1998)). Stabilization 
is now achieved through an ambisense connection with either of the 
regular viral RNA segments (NS or HA) which also allowed to reach a level 
of 100% recombinant viruses instead of an hitherto only 20% (Fig. 13), 
since all carrier viruses are destroyed through ribozyme action, see Figs. 8 

30 and 12. The helper virus containing preparation has already been used 
successsfully as a vaccine against CSFV infection (antibody titers of 
1:40000); the increase achieved in recombinant viruses allows a further 
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improvement in that regard. Also (cost-effective) propagation in fertilized 
chicken eggs has become possible due to its stable incorporation of the 
foreign gene as a covalent ambisense construct. 

5 C.3: Construction of hepatitis C glycoprotein-carrying recombinant 
influenza viruses as a candidate vaccine: 

Hepatitis C virus is a close relative of CSF virus (hog cholera virus), and in 
particular its set of two glycoproteins, small-size El and larger-size E2, is 
closely related in structural detail and presumably also in function to the 

10 corresponding CSFV proteins. An incorporation of HCV-E2/HA fusion 
proteins into influenza viral envelopes has been achieved in analogy to the 
CSFV-E2/HA incorporation. In addition, incorporation of an anchor-fusion 
glycoprotein HCV-E1/HA or both together (in NS and HA ambisense 
junctions) allows further variations in constructing an influenza-based 

15 vaccine for hepatitis C. In analogy to CSFV-E2, neutralizing antibodies are 
expected to be directed against particular epitopes of HCV-E2, presented 
in essentially native conformation at the influenza viral envelope. 

C.4: Stable incorporation of selected influenza T-cell epitopes in 

20 ambisense constructions: 

Influenza infection is known to result in both, antibody production against 
that specific viral strain or indeed its epitopes that are located mainly at 
the surface of HA, and in an increase of specifically primed cytotoxic T- 
lymphocytes, stimulated by T-cell epitopes primarily located within the 

25 essentially invariable core structure of the NP protein. While the humoral 
response will result in life-long immunity against that particular strain of 
influenza or its epitope structures, the T-cell response will be lost or 
severely reduced some time afterwards, such that its specificity against 
influenza in general will fall below protective levels. One way in trying to 

30 increase that level of cellular immunity is to enhance the response or 
recruitment of influenza-specific CTL cells by increasing the level of T-cell 
epitopes in the infected cell and hence its presentation on the surface by 
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MHC-I receptors. This is achieved by combining in an ambisense construct 
the HA gene and a series of repeated T-cell epitope sequences as present 
in the influenza NP gene, in a model design specific for the BALB/C mouse 
MHC-I allele. Here, the promoter-up expression rate is realized (in vRNA 

5 promoter-controlled initiation) for expression of the repetitive epitope 
polypeptide chain. Alternatively or in addition a controlled secretion of an 
interleukin can be achieved from recombinant influenza-infected cells, 
upon ambisense incorporation of the respective gene preferrably into the 
NS segment. The interleukin to be chosen for this purpose (IL-12 or other) 

10 is selected to enhance the longevity of influenza-specific CTL cells or its 
conversion into corresponding memory cells. In this way an ambisense 
vaccine strain against influenza itself is achieved with expected protective 
capacity against influenza in general. 

15 C.5: Exchange of influenza glycoproteins against foreign viral 
glycoproteins (VSV-G): 

The 'marker rescue' experiment described above (section A.4.), i.e. an 
exchange of one HA gene (ribozyme-sensitive) for another (ribozyme- 
resistant) can also be used for an exchange of HA for an entirely different 

20 glycoprotein, such as the vesicular stomatitis virus G protein, as long as it 
is attached to the HA anchor segment. Effective incorporation depends 
always on that C-terminal tail sequence and its interaction with underlying 
matrix protein Ml, and therefore, all constructions consist of fusion 
proteins in direct analogy to CSFV-E2. VSV-G with or without a foreign 

25 anchor sequence has been shown in several other viruses to be able to 
substitute for the original glycoprotein and to result in infectious viruses 
with VSV-G specific host-ranges (e.g. in retroviruses, rabies virus, measles 
virus). G protein in VSV itself as well as on the surface of foreign viruses is 
the only glycoprotein required for all of the consecutive steps in infection. 

30 Insertion of VSV-G instead of HA in recombinant influenza viruses leaves 
the second glycoprotein, neuraminidase, without any function, which then 
will get lost spontaneously from the recombinant viruses. This will further 
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increase the capacity for an addition of foreign genes, beyond the gain 
resulting from an exchange of the larger HA for the smaller VSV-G, which 
might be used for an addition of ambisense constructs. 

5 The invention is further illustrated in the accompanying figures. 

Detailed Description of the Figures 

Fia.l: 3' nucleotide extensions of influenza vRNA template molecules: 

10 (A) Murine B82 cells have been transfected by plasmid cDNA constructs 
designed to be transcribed into influenza vRNA molecules by RNA 
polymerase I in vivo, followed after 20 h by standard FPV Br atisiava helper 
virus infection, at an moi of 1 to 3. In addition to reference plasmid 
pHL2024 (no extension), related cDNA constructs carrying extensions of 1 

15 to 50 bp, and hence extended by 1 to 50 nucleotides at the resulting vRNA 
3 1 ends were used in parallel transfections; template extensions are 
marked at the top of the figure. Cell lysates prepared at 8 h post helper 
virus infections were used for CAT reactions using in one round 50 pi of 
cell lysate each, and in further analyses 5pl and 0.5 pi of lysate (not 

20 shown). Relative yields were determined in comparison to reference 
plasmid pHL2024, as indicated below the figure, with calculations 
restricted to those CAT assays showing less than 40% of substrate 
consumption, in three or more independent experiments. 

(B) Viral passage of B82 supernatants containing recombinant influenza 
25 virus onto MDCK cells, at an moi of 2 to 4, in average. Again at 8 h post 
infection cell lysates have been prepared and used for CAT assays. 
Relative yields as indicated below the figure have been determined in 
comparison to pHL2024 the same way as in (A), using 0.5 pi of cell lysate 
in each case. (The 50 pi CAT assays as shown here and also in the 
30 following figures intend to give an immediate impression of relative 
activities at always the same level, while the actual measurement data as 



WO 00/53786 PCT/EP00/01903 

33 

indicated below the lanes are obtained at various appropriate enzyme 
concentrations relative to reference pHL2024.) 

Fig. 2: Propagation of recombinant influenza viruses with tandem 
5 bicistronic vRNA: 

(A) General design of expression plasmids for transient bicistronic vRNAs 
coding for GFP in the mRNA-proximal, and for CAT in the mRNA-distal 
position. Among the functional elements indicated are the human RNA 
polymerase I promoter (p Ih ) and murine rDNA terminator (ti) sequences, 
10 both hatched, and the 5' and 3' vRNA promoter cDNA sequences, open 
and closed boxes, respectively. For the vRNA-internal 3' promoter signal 
three variant sequences have been inserted as indicated below (positions ~ 
1 to 15 refer to 3'-terminal nucleotides in the resulting monocistronic 
vRNAs). 

15 (B) CAT assays as determined relative to pHL1844 (monocistronic CAT 
construct) after DNA transfection of 293T cells plus helper virus infection 
followed by one round of progeny viral propagation on MDCK cells are 
indicated below the lanes. Relative activities of the internal promoter 
sequences as indicated above the figure refer to measurements in a 

20 monocistronic external location of the same promoter variants (Flick et al., 
Promoter elements in the influenza vRNA terminal structure, RNA 2, 1046- 
1057 (1996)). Control clone pHL2300 contains an unrelated, non- 
functional sequence in the central location in an otherwise identical 
plasmid construct. 

25 

Fig. 3: Tande m bicistronic vRNA sup-portino an alternative mode of tran- 
scription and replication initiation: 

An additional internal 3' promoter sequence has been inserted in between 
both cistrons, in a vRNA-central position. Left half: bicistronic replication 
30 and transcription leading to (proximal) GFP expression. Right half: internal 
initiation resulting in monocistronic replication and transcription leading to 
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(distal) CAT expression, and causing deletion of the GFP sequence from 
progeny molecules. 

Fia. 4: Outgrowth of promoter-up recombinant vRNA versus wildtype 
5 vRNA segments in stepwise propagation of influenza virus: 

RNA polymerase I transcription of transfected- pHL2969 DNA results in 
influenza vRNA carrying (an external) promoter-up mutant '1104', and 
containing 2x2 ribozyme targets in flanking positions relative to its HA 
coding sequence. Another HA vRNA segment in wild-type configuration 
10 and originating from infecting FPV helper virus is also present in the 
recombinant virus preparation, initially (lane 1; 293T lysate) in surplus 
amounts, but reduced and finally lost entirely in consecutive steps of 
propagation (lanes 2 to 4; MDCK cell lysates), and in isolated strains after 
pAM403 ribozyme treatment for removal of the external '1104' promoter 
15 sequence (lane 5). Determination throughout by RT-PCR analyses using a 
pair of primers extending across the 5* inserted target site sequence, with 
435 bp representing the recombinant HA segment, and 306 bp the wild- 
type sequence without an inserted target site sequence. 

20 Fio.5: PAM403 ribozvme cleavage of PHL2969 derived vRNA molecules at 
specific target sites inserted between an external and an internal 
3' promoter sequence: 

The external promoter-up ('1104') signal is used for vRNA amplification 
within recombinant viruses and reduction of helper virus HA vRNA (Fig. 4), 

25 while the 'switch' to an internal wild-type signal guarantees stable 
replication of recombinant viruses. pAM403 hammerhead ribozyme RNAs 
are indicated in complementary binding to their target site sequences (12 
and 10 nucleotides flanking the GU'C cleavage point) by straight lines 
flanking a central secondary structure symbol. vRNA-internal 2x2 

30 ribozyme targets are marked by xx (see Fig. 7). 
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Fig. 6: Comparative cleavage analysis of model CAT vRNAs with tandem 
target sites in various flanking positions, bv taraet-specific ribozvmes: 
293T tissue culture cells have been transiently DNA-transfected either by 
a single-headed hammerhead ribozyme (s), or a double-headed (d), or 

5 triple-headed (t) ribozyme cDNA construct, all specifically designed to 
hybridize to a tandem dimer target site sequence inserted in flanking 
positions into the CAT vRNA. All ribozyme RNAs have been expressed from 
the same pSV2-neo plasmid vector, including a pSV2-neo control 
construct without an inserted ribozyme cDNA sequence (c). At 20 h after 

10 DNA transfection (which reached 65% yield as measured by pcmv-GFP 
transfection in parallel of the same cell culture) the 293T cells were 
infected by CAT recombinant viruses carrying tandem double target 
sequences either only in vRNA-3' position, or in both vRNA-3' and 5' 
positions, or in both vRNA-3' and cRNA-3' positions. Most effective among 

15 the s, d, or t-ribozymes were double-headed constructs, acting on 2x2 
targets inserted in either of the two localizations described (lanes 6 and 
10). 



Fio. 7: Alignment of PAM424 double-headed ribozvme with one of their 
20 repetitive target seouences located within the 5' and 3' vRNA flanking 
regions: 

The superior activity of ribozymes oriented against targets located in the 
3' end of vRNA molecules over those present in the 5" end instead (not 
shown) is in agreement with the model for influenza vRNA transcription 

25 and replication (Lamb and Krug, Orthomyxoviridae: The viruses and their 
replication. In 'Virology' (B.N. Fields, D.M.Knipe, P.M.Howley, R.M.Chanock, 
J.L.Melnick, T.P.Monath, B.Roizman, and S.E.Straus, Eds.), 3 rd ed., Vol. 1, 
pp. 1353-1395. Lippincott-Raven, Philadelphia (1996)), according to which 
influenza polymerase stays attached to the 5' end of the vRNA molecule 

30 throughout the entire or even several rounds of transcription, whereas the 
very 3' end repeatedly, in every initiation reaction serves as the template 
sequence, and consequently is no longer covered by polymerase. 
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Superiority of a double-headed over a single-headed ribozyme has been 
determined earlier in this laboratory (A.Menke, Anti-Influenza Ribozyme: 
vRNA-Spaltung und intrazellulare Aktivitat. Dissertation Universitat 
Giessen (1997)), but the substantial increase of vRNA inactivation rates 
5 upon incorporation of tandem target sites at both ends of the vRNA 
molecule instead of only one has been observed here for the first time, 
within that overall design. 

Fig. 8: pAM424 ribozyme cleavage of resistant FPV wild-type HA vRNA and 
10 ribozyme-sensitive pHL2969-derived HA-vRNA in 293T cells infected by 
vHM41 after isolation from pHL2969-recombinant viral preparations. 
Lane 1: FPV infection of 293T cells, untreated; lane 2: FPV infection of 
293T cells DNA-transfected by pAM424; lane 3: vHM41 infection of 293T 
cells, untreated; lane 4: vHM41 infection of 293T cells DNA-transfected by 
15 pAM424. RT-PCR analyses of purified viral progeny as in Fig. 4. 

Fia. 9: Functional analysis of the influenza cRNA promoter structure: 

(A) Schematic cRNA promoter ('1104') secondary structure according to 
the 'corkscrew' model; nucleotides involved in single or double nucleotide 

20 exchange are marked by their position. 

(B) CAT analyses of 293T cell lysates after DNA transfection and FPV 
helper virus infection of cRNA promoter variants, in comparison to 
standard vRNA promoter-up mutant '1104' (pHL1844). Nucleotide 
substitutions divergent from the basic '1104' structure as present in 

25 pHL2583 or pHL2721 (see above) are indicated above the lanes, positions 
3 or 8 as marked by a bar refer to cRNA positions counted from the 
3' end. Relative CAT activities are marked below the lanes. 

Fig. 10: Functional analysis of the vRNA and cRNA promoter in ambisense 



30 arrangement: 
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(A) Sequence organisation of the ambisense promoter cDNA construct 
carrying T 6 /A 6 elements adjacent to the terminal sequence, and secondary 
structure predictions for the resulting cRNA and vRNA promoter signal. 

(B) CAT expression data obtained from the cell lysates of 293T cell after 
5 plasmid DNA transfection and FPV infection, and (C) from cell lysates of 

MDCK cells after one step of viral passage. Indicated above the lanes are 
promoter/gene conjunctions: v = vRNA promoter; c = cRNA promoter. 

Fia.ll: Basepair substitutions according to the vRNA 'corkscrew' structure: 
10 (A) 'Corkscrew' conformation of the vRNA promoter drawn against a 
schematic indication of interacting tripartite viral polymerase. Paired 
positions exchanged in individual experiments are indicated by numbers, 
nucleotides 3 or 8 are counted from the 3' end. pHL2024 containing 
promoter-up mutation '1104' is used as the reference construct ( = 100%) 
15 in all of the CAT assays, while pHL2428 represents the wild-type promoter 
structure. 

(B) CAT expression data obtained after one step of viral passage in MDCK 
undiluted, and 50 fold diluted. 

20 Fig. 12: Flow-chart of the isolation procedure for an ambisense 
recombinant influenza virus. 

Fig. 13: Immuno-electron microscopy of purified influenza FPV/CSFV-E2- 
HA virions: 

25 Recombinant viruses exposing the foreign glycoprotein CSFV-E2 in their 
envelopes, which has been fused onto the HA anchor domain, are marked 
by anti-E2 monospecific antibody and by secondary gold-labelled (5nm) 
goat antibody. Recombinant viruses (16%) are present together with 
their FPV helper viruses. 

30 

Fig. 14: pHL2969; the exact sequence of the 4930 bps circular DNA is 
shown in SEQ. ID NO:l. 
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Fig. 15: pAM403; the exact sequence of the 5811 bps circular DNA is 
shown in SEQ. ID NO:2. 

Fig. 16: pAM424; the exact sequence of the 5860 bps circular DNA is 
shown in SEQ. ID NO:5. 

Fig- 17: pHL2507; the exact sequence of the 4610 bps circular DNA is 
shown in SEQ. ID NO:6. 

F'Q- 18: pHL2583; the exact sequence of the 3558 bps circular DNA is 
shown in SEQ. ID NO:7. 

Fia. 19: pHL2989; the exact sequence of the 4343 bps circular DNA is 
shown in SEQ. ID NO:8. 

Fig. 20: pHL1920; the exact sequence of the 3888 bps circular DNA is 
shown in SEQ. ID NO:9. 

Hence, the present invention is based on two surprising findings, namely 

1. influenza virus promoters are active when present internally in a gene; 

2. the so-called cRNA, thought to be an intermediate in replication can be 
turned into a protein-encoding RNA by equipping it with a variant influenza 
virus promoter, described in the present invention. 

These two observations were used to make ambisense constructs. This 
allows to package an additional, foreign gene into influenza virus particles. 
Such particles were made previously, by other methods, but proved to be 
unstable, and therefore useless. For use as a vaccine for example, a 
helper virus would have been needed as a stabilizer. Stabilization in the 
present invention is achieved by several means. These include the 
"balancing" of one of the two promoters in the ambisense biscistronic 
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genetic construct with seven other vRNA wildtype promoters, while the 
additional promoter is used for high-rate expression of the foreign gene at 
various levels. 

5 Thus, the present invention provides a system for expression of foreign 
proteins in higher eukaryotic systems. One system in particular is 
interesting, namely embryonated chicken eggs, as it allows cost-effective 
production in an automatable way (as used by most flu vaccine 
producers). The reason that this process is now possible, is that the 
10 foreign protein is part of a stable, engineered influenza virus particle. The 
virus can be designed also to rapidly monitor process improvements. 

An excellent use is of course the use of the construct as a vaccine. The 
influenza virus particle is immunogenic and can now be equipped with 

15 foreign antigens, enabling for example the design and production of 
hepatitis C virus and HIV vaccines, but also of tumor vaccines. As the 
present invention shows, the foreign antigenic surface glycoprotein is 
"fused" to a C-terminal segment of influenza HA, and the antigen then is 
presented at the surface of influenza virus particles. In addition, these 

20 vaccines can now be made in the way standard flu vaccines are made, i.e., 
in embryonated chicken eggs. 
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Claims 

1. A recombinant influenza virus for high-yield expression of incorporated 
5 foreign gene(s), which is genetically stable in the absence of any helper 

virus and which comprises at least one viral RNA segment being an 
ambisense RNA molecule (ambisense RNA segment) and containing one of 
the standard viral genes in sense orientation and a foreign, recombinant 
gene in anti-sense orientation, or wee versa, in overall convergent 
10 arrangement. 

2. The recombinant influenza virus of claim 1, wherein at least one of the 
regular viral RNA segments is replaced by an ambisense RNA segment 
which contains one of the standard viral genes in sense orientation and a 

15 foreign, recombinant gene in anti-sense orientation, or vice versa, in 
overall convergent arrangement. 

3. The recombinant virus according to claims 1 and 2, wherein in the 
ambisense RNA molecule said foreign recombinant gene is covalently 

20 bound to one of the viral genes, while the original vRNA segment coding 
for the same gene is deleted from the recombinant virus by way of specific 
ribozyme cleavage. 

4. The recombinant influenza virus according to claims 1 to 3, wherein 
25 one or more of the regular viral RNA segments, differing from said at least 

one ambisense RNA segment, comprises a vRNA encoding a foreign gene, 
preferably one or more of the regular viral RNA segments has (have) been 
exchanged for a vRNA encoding a foreign gene. 

30 5. The recombinant influenza virus according to claim 4 in which one or 
both of the standard glycoproteins hemagglutinin and neuraminidase have 
been exchanged into foreign glycoprotein (s) or into fusion glycoproteins 
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consisting of an anchor segment derived from hemagglutinin and an 
ectodomain obtained from the foreign source, viral or cellular, or in which 
such recombinant glycoprotein has been inserted as a third molecular 
species in addition to the remaining standard components. 

6. The recombinant influenza virus according to claims 1 to 5, in which 
the terminal viral RNA sequences of one or more of the regular segments 
and/or of the at least one ambisense RNA segment, which are active as 
the promoter signal, have been modified by nucleotide substitutions in up 
to five positions, resulting in improved transcription rates of both the 
vRNA promoter as well as the cRNA promoter as present in the 
complementary sequence. 

7. The recombinant influenza virus of claim 6, wherein the 12 nucleotide 
conserved influenza 3' terminal sequence has been modified by 
replacement of one to three nucleotides occurring in said sequence at 
positions 3, 5 and 8 relative to the 3' end by other nucleotides, and/or 
wherein the 13 nucleotide conserved influenza 5' terminal sequence has 
been modified by replacement of one or two nucleotides occurring in said 
sequence at positions 3 and 8 by other nucleotides. 

8. The recombinant influenza virus of claim 7, wherein the replacements 
in the 3' terminal nucleotide sequence comprises the modifications G3A 
and C8U. 

9. The recombinant influenza virus of claim 8, wherein the replacements 
in the 3' terminal nucleotide sequence comprises the modifications G3A, 
U5C and C8U, or G3C, U5C and C8G. 

10. The recombinant influenza virus of claim 9, which comprises a 3' 
terminal nucleotide sequence of S'-CCUGUUUCUACU-S'. 
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11. The rcombinant influenza virus of claims 7 to 10, wherein the 5" 
terminal nucleotide sequence comprises the modifications U3A and A8U 
resulting in a 5'-terminal sequence of 5'-AGAAGAAUCAAGG. 

12. The recombinant influenza virus according to claims 1 to 11, which is a 
recombinant influenza A virus. 



13. The recombinant influenza virus according to claims 1 to 12, in which 
the foreign gene(s) in ambisense covalent junction with viral gene(s) code 
for proteins and/or glycoproteins which are secreted from cells infected 
with the recombinant virus. 



14. The recombinant virus according to claims 1 to 12, in which the foreign 
gene(s) in ambisense covalent junction with viral gene(s) code for proteins 
or artificial polypeptides designed to support an efficient presentation of 
inherent epitopes at the surface of infected cells, for stimulation of a B cell 
and/or T cell response. 

15. A method for the production of recombinant influenza viruses as 
defined in claims 1 to 14 comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, in 
antisense, sense or ambisense design, 

(b) followed by infection with an influenza carrier strain constructed to 
include flanking ribozyme target sequences in at least one of its viral RNA 
segments, and 

(c) thereafter selective vRNA inactivation through ribozyme cleavage. 

16. A method of constructing influenza carrier strains carrying one or 
more ribozyme target sites (of type one) in vRNA flanking positions 
comprising 

(a) RNA polymerase I synthesis of recombinant vRNAs in vivo, carrying 
two different 3' promoter sequences in tandem, which are separated by a 
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second type of ribozyme target sequence, and which carry the said 
internal ribozyme target sites of type one; 

(b) followed by infection of an influenza wildtype strain; 

(c) thereafter amplification through simple steps of viral propagation; and 

(d) finally isolation through removal of their external 3' promoter 
sequence by ribozyme cleavage through infection of cells expressing 
ribozyme type 2, followed by plaque purification. 

17. A ribozyme-sensitive influenza carrier strain obtainable by the method 
of claim 16. 

18. A pharmaceutical composition comprising a recombinant influenza 
virus according to claims 1 to 14. 

19. Use of a recombinant influenza virus according to claims 1 to 14 for 
preparing a medicament for vaccination purposes. 

20. The use according to claim 19, wherein the medicament 

(a) is suitable against influenza and/or against other infections; 

(b) is present in form of inactivated preparations; and/or 

(c) is present in form of live recombinant viruses. 

21. Use of a recombinant influenza virus according to claims 1 to 14 for 
preparing agents for somatic gene therapy. 

22. Use of a recombinant influenza virus according to claims 1 to 14 for 
preparing agents, for transfer and expression of foreign genes into cells 
infected by such viruses. 

23. Use of a recombinant influenza virus according to claims 1 to 14 for 
preparing agents for transfer and expression of RNA molecules into cells 
infected by such viruses. 
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24. The use of claim 23, wherein the RNA molecules to be expressed are 
antisense sequences or double-strand sequences relative to the target cell 
cellular mRNA molcules, and/or the agent is suitable for sequence-specific 
gene silencing, preferably by antisense RNAor RNA interference 
mechanisms. 

25. The use according to claims 21 to 24, wherein the agents are 
applicable in ex vivo and in vivo application schemes. 

26. A method for the production of proteins or glycoproteins which 
comprises utilizing a recombinant influenza virus according to claims 1 to 
14 as expression vector. 

27. The method of claim 26, wherein the production is performed in cell 
culture cells or in fertilized chicken eggs. 

28. A method for preventing and/or treating influenza which comprises 
administering an effective amount of a recombinant influenza virus 
according to claims 1 to 14 to the mammal to be treated. 

29. A method for somatic gene therapy, which method comprises 
subjecting the organism to be treated with a recombinant influenza virus 
according to claims 1 to 14. 

30. A method for transfer and expression of foreign genes into cells, and 
for transfer and expression of RNA molecules into cells, which method 
comprises infecting the cells with a recombinant influenza virus according 
to claims 1 to 14. 

31. Use of a recombinant influenza virus according to claims 1 to 14 for 
preparing agents for autologous immunotherapy. 
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32. A method for an immunotherapy which comprises ex vivo infection of 
immune cells with a recombinant influenza virus according to claims 1 to 
14, and introduction of the transduced cells into the patient. 

33. A method for the induction of antibodies which comprises utilizing a 
recombinant influenza virus according to claims 1 to 14 as an immunogen. 
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SEQUENCE LISTING 

<110> ARTEMIS PHARMACEUTICALS GmbH 

5 <120> Stable Recombinant Influenza Viruses Free of Helper 
Viruses 

<130> 000520wo/JH/ml 

10 <140> 
<141> 

<160> 26 

15 <170> Patentln Ver. 2.1 



20 



25 



30 



35 



40 



45 



50 



55 



60 



<210> 1 
<211> 4930 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



pHL2969 



<400> 1 

cgtacgaagc 

agacgaaaaa 

ttagtgaggg 

taataacccg 

ggaggtcgcg 

ccaaaaacgc 

gacacacgcg 

gggggccggc 

gaccgggaca 

ctcagggagc 

aatgaatcag 

aaccgtaaaa 

cacaaaaatc 

gcgtttcccc 

tacctgtccg 

tatctcagtt 

cagcccgacc 

gacttatcgc 

ggtgctacag 

ggtatctgcg 

ggcaaacaaa 

agaaaaaaag 

aacgaaaact 

atccttttaa 

tctgacagtt 

tcatccatag 

tctggcccca 

gcaataaacc 

tccatccagt 

ttgcgcaacg 

gcttcattca 

aaaaaagcgg 

ttatcactca 

tgcttttctg 

ccgagttgct 



ttctagaggg 
catatgctag 
ttaattgcgc 
gcggcccaaa 
tcaccgacca 
caccatcgca 
ggagcagcgc 
ccgtgtctcc 
cacgagggac 
gctctccgac 
gggataacgc 
aggccgcgtt 
gacgctcaag 
ctggaagctc 
cctttctccc 
cggtgtaggt 
gctgcgcctt 
cactggcagc 
agttcttgaa 
ctctgctgaa 
ccaccgctgg 
gatctcaaga 
cacgttaagg 
attaaaaatg 
accaatgctt 
ttgcctgact 
gtgctgcaat 
agccagccgg 
ctattaattg 
ttgttgccat 
gctccggttc 
ttagctcctt 
tggttatggc 
tgactggtga 
cttgcccggc 



attggctgag 
agcggccgcc 
gcaggcctag 
atgccgactc 
cgccgccggc 
gccacacacg 
cgggccgggg 
agagcgggag 
cgaaggccgg 
tccgcacggg 
aggaaagaac 
gctggcgttt 
tcagaggtgg 
cctcgtgcgc 
ttcgggaagc 
cgttcgctcc 
atccggtaac 
agccactggt 
gtggtggcct 
gccagttacc 
tagcggtggt 
agatcctttg 
gattttggtc 
aagttttaaa 
aatcagtgag 
ccccgtcgtg 
gataccgcga 
aagggccgag 
ttgccgggaa 
tgctacaggc 
ccaacgatca 
cggtcctccg 
agcactgcat 
gtactcaacc 
gtcaacacgg 



acgaaaaaca 
accgcggtgg 
ctaggtaaag 
ggagcgaaag 
ccaggcgacg 
gagcgcccgg 
acgccctccc 
ccggaagcat 
ccaggcgcga 
gactcgccag 
atgtgagcaa 
ttccataggc 
cgaaacccga 
tctcctgttc 
gtggcgcttt 
aagctgggct 
tatcgtcttg 
aacaggatta 
aactacggct 
ttcggaaaaa 
ttttttgttt 
atcttttcta 
atgagattat 
tcaatctaaa 
gcacctatct 
tagataacta 
gacccacgct 
cgcagaagtg 
gctagagtaa 
atcgtggtgt 
aggcgagtta 
atcgttgtca 
aattctctta 
aagtcattct 
gataataccg 



tatgctagag 
agctccagct 
aaaaataccc 
atatacctcc 
cgcgacacgg 
ggccctctgg 
ggccgcccgt 
tttcggccgg 
cctctcgggc 
aaaggatcgt 
aaggccagca 
tccgcccccc 
caggactata 
cgaccctgcc 
ctcatagctc 
gtgtgcacga 
agtccaaccc 
gcagagcgag 
acactagaag 
gagttggtag 
gcaagcagca 
cggggtctga 
caaaaaggat 
gtatatatga 
cagcgatctg 
cgatacggga 
caccggctcc 
gtcctgcaac 
gtagttcgcc 
cacgctcgtc 
catgatcccc 
gaagtaagtt 
ctgtcatgcc 
gagaatagtg 
cgccacatag 



ggattggctg 
tttgttccct 
ttgtttctac 
cccggggccg 
acacctgtcc 
tcaaccccag 
gccacacgca 
cccctcctac 
cgcacgcgcg 
gacctgcatt 
aaaggccagg 
tgacgagcat 
aagataccag 
gcttaccgga 
acgctgtagg 
accccccgtt 
ggtaagacac 
gtatgtaggc 
gacagtattt 
ctcttgatcc 
gattacgcgc 
cgctcagtgg 
cttcacctag 
gtaaacttgg 
tctatttcgt 
gggcttacca 
agatttatca 
tttatccgcc 
agttaatagt 
gtttggtatg 
catgttgtgc 
ggccgcagtg 
atccgtaaga 
tatgcggcga 
cagaacttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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1800 

1860 

1920 

1980 

2040 
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aaagtgctca tcattggaaa acgttcttcg gggcgaaaac tctcaaggat cttaccgctg 2160 
ttgagatcca gttcgatgta acccactcgt gcacccaact gatcttcagc atcttttact 2220 
ttcaccagcg tttctgggtg agcaaaaaca ggaaggcaaa atgccgcaaa aaagggaata 2280 
agggcgacac ggaaatgttg aatactcata ctcttccttt ttcaatatta ttgaagcatt 2340 
5 tatcagggtt attgtctcat gagcggatac atatttgaat gtatttagaa aaataaacaa 2400 
aagagtttgt agaaacgcaa aaaggccatc cgtcaggatg gccttctgct taatttgatg 2460 
cctggcagtt tatggcgggc gtcctgcccg ccaccctccg ggccgttgct tcgcaacgtt 2520 
caaatccgct cccggcggat ttgtcctact caggagagcg ttcaccgaca aacaacagat 2580 
aaaacgaaag gcccagtctt tcgactgagc ctttcgtttt atttgatgcc tggcagttcc 2640 

10 ctactctcgc atggggagac cccacactac catcggcgct acggcgtttc acttctgagt 2700 
tcggcatggg gtcaggtggg accaccgcgc tactgccgcc aggcaaattc tgttttatca 2760 
gaccgcttct gcgttctgat ttaatctgta tcaggctgaa aatcttctct catccgccaa 2820 
aacagaagct agcggccgat ccccaaaaaa aaaaaaaaaa aaaaaaaaaa gagtccagag 2880 
tggccccgcc gttccgcgcc gggggggggg gggggggggg acactttcgg acatctggtc 2940 

15 gacctccagc atcgggggaa aaaaaaaaaa caaagtttcg cccggagtac tggtcgacct 3000 
ccgaagttgg gggggagtag aaacagggta gataatcact cactgacgta cgttgagcaa 3060 
ctgactgaaa tgccttgagc aactgactga aatgcctgac gtctttagca aaagcagggt 3120 
agataatcac tcactgagtg acatccacat cgtaccagga ttggctgaga cgaaaaacat 3180 
attgtaccag ggattggctg agacgaaaaa catattgtag gtaccaaaat gaacactcaa 324 0 

20 atcctggttt tcgcccttgc ggcagtcatc cccacaaatg cagacaaaat. ttgtcttgga 3300 
catcatgctg tatcaaatgg caccaaagta aacacactca ctgagagagg agtagaagtt 3360 
gtcaatgcaa cggaaacagt ggagcggaca aacatcccca aaatttgctc aaaagggaaa 3420 
agaaccactg atcttggcca atgcggactg ttagggacca ttaccggacc acctcaatgc 3480 
gaccaatttc tagaattttc agctgatcta ataatcgaga gacgagaagg aaatgatgtt 3540 

25 tgttacccgg ggaagtttgt taatgaagag gcattgcgac aaatcctcag aggatcaggt 3600 
gggattgaca aagaaacaat gggattcaca tatagtggaa taaggaccaa cggaacaact 3660 
agtgcatgta gaagatcagg gtcttcattc tatgcagaaa tggagtggct cctgtcaaat 3720 
acagacaatg cttctttccc acaaatgaca aaatcataca aaaacacagg gagagaatca 3780 
gctctgatag tctggggaat ccaccattca ggatcaacca ccgaacagac caaactatat 3840 

30 gggagtggaa ataaactgat aacagtcggg agttccaaat atcatcaatc ttttgtgccg 3900 
agtccaggaa cacgaccgca gataaatggc cggtccggac ggattgattt tcattggttg 3 960 
atcttggatc ccaatgatac agttactttt agtttcaatg gggctttcat agctccaaat 4 020 
cgtgccagct tcttgagggg aaagtccatg gggatccaga gcgatgtgca ggttgatgct 4 080 
aattgcgaag gggaatgcta ccacagtgga gggactataa caagcagatt gccttttcaa 4140 

35 aacataaata gcagagcagt tggcaaatgc ccaagatatg taaaacagga aagtttatta 4200 
ttggcaactg ggatgaagaa cgttcccgaa ccttccaaaa aaaggaaaaa aagaggcctg 4260 
tttggtgcta tagcagggtt tattgaaaat ggttgggaag gtctggtcga cgggtggtac 4 320 
ggtttcaggc atcagaatgc acaaggagaa ggaactgcag cagactacaa aagcacccaa 4380 
tcggcaattg atcagataac cggaaagtta aatagactca ttaagaaaac caaccagcaa 4440 

40 tttgagctaa tagataatga attcactgaa gtggaaaagc agattggcaa tttaattaac 4 500 
tggaccaaag actccatcac agaagtatgg tcttacaatg ctgaacttct tgtggcaatg 4560 
gaaaaccagc acactattga tttggctgat tcagagatga acaagctgta tgagcgagtg 4 620 
aggaaacaat taagggaaaa tgctgaagag gatggcactg gttgctttga aatttttcat 4 680 
aaatgtgacg atgattgtat ggctagtata aggaacaata cttatgatca cagcaaatac 4740 

45 agagaagaag cgatgcaaaa tagaatacaa attgacccag tcaaattgag tagtggctac 4800 
aaagatgtga tactttggtt tagcttcggg gcatcatgct ttttgcttct tgccattgca 4860 
atgggccttg ttttcatatg tgtgaagaac ggaaacatgc ggtgcactat atgcatttaa 4920 
agcttgcatg 4 930 

50 

<210> 2 
<211> 5811 
<212> DNA 

<213> Artificial Sequence 

55 

<220> 

<223> Description of Artificial Sequence: pAM403 
<400> 2 

60 aattcctttg cctaatttaa atgaggactt aacctgtgga aatattttga tgtgggaagc 60 
tgttactgtt aaaactgagg ttattggggt aactgctatg ttaaacttgc attcagggac 120 
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acaaaaaact catgaaaatg gtgctggaaa 
tgctgttggt ggggaacctt tggagctgca 
tcctgctcaa actgtaaccc caaaaaatgc 
ccacaaggct gttttggata aggataatgc 
5 aagtaaaaat gaaaacacta gatattttgg 
tgttttgcac attactaaca cagcaaccac 
cttgtgcaaa gctgacagct tgtatgtttc 
cacttctgga acacagcagt ggaagggact 
gcggtctgtg aaaaacccct acccaatttc 
10 gacacagagg gtggatgggc agcctatgat 
ggtttatgag gacacagagg agcttcctgg 
gtttggacaa accacaacta gaatgcagtg 
tgctattgct ttatttgtaa ccattataag 
cattcatttt atgtttcagg ttcaggggga 
15 cctctacaaa tgtggtatgg ctgattatga 
tccttattaa cccctttaca aattaaaaag 
ttaatagcag acactctatg cctgtgtgga 
ctgttatgcc tacttataaa ggttacagaa 
gcagcttttt cctttgtggt gtaaatagca 
20 gcatgactca aaaaacttag caattctgaa 
ttcttttttg gaggagtaga atgttgagag 
atttcttctg agcaaaacag gttttcctca 
tcagttccat aggttggaat ctaaaataca 
tatacactta aaaattttat atttacctta 
25 aattatgtca caccacagaa gtaaggttcc 
cgtgcctccc cactcctgca gttcgggggc 
ctggatgccg acggatttgc actgccggta 
gcagctgaac caactcgcga ggggatcgag 
atccccgcgc tggaggatca tccagccggc 
30 ttcatagaag gcggcggtgg aatcgaaatc 
ggtcatttcg atgaattcga gctcggtacc 
tcctcacgga ctcatcagag ttgctcaatt 
gtcaagaagg cgatagaagg cgatgcgctg 
gaggaagcgg tcagcccatt cgccgccaag 
35 tatgtcctga tagcggtccg ccacacccag 
gccattttcc accatgatat tcggcaagca 
gccgtcgggc atgcgcgcct tgagcctggc 
ctcttcgtcc agatcatcct gatcgacaag 
gatgcgatgt ttcgcttggt ggtcgaatgg 
40 ccgcattgca tcagccatga tggatacttt 
atcctgcccc ggcacttcgc ccaatagcag 
gagcacagct gcgcaaggaa cgcccgtcgt 
ctgcagttca ttcagggcac cggacaggtc 
cgctgacagc cggaacacgg cggcatcaga 
45 gccgaatagc ctctccaccc aagcggccgg 
catgcgaaac gatcctcatc ctgtctcttg 
ccttggcggc aagaaagcca tccagtttac 
cgccccagct ggcaattccg gttcgcttgc 
ccatgtaagc ccactgcaag ctacctgctt 
50 gatagcccag tagctgacat tcatccgggg 
cgtgttccgc ttcctttagc agcccttgcg 
ttgcaaaagc ctaggcctcc aaaaaagcct 
gaggcggcct cggcctctgc ataaataaaa 
gcggaactgg gcggagttag gggcgggatg 
55 gactaattga gatgcatgct ttgcatactt 
cacctggttg ctgactaatt gagatgcatg 
ggggactttc cacaccctaa ctgacacaca 
gacggtgaaa acctctgaca catgcagctc 
gatgccggga gcagacaagc ccgtcagggc 
60 gcagccatga cccagtcacg tagcgatagc 
cagagcagat tgtactgaga gtgcaccata 



acccattcaa gggtcaaatt ttcatttttt 180 
gggtgtgtta gcaaactaca ggaccaaata 240 
tacagttgac agtcagcaga tgaacactga 300 
ttatccagtg gagtgctggg ttcctgatcc 360 
aacctacaca ggtggggaaa atgtgcctcc 420 
agtgcttctt gatgagcagg gtgttgggcc 480 
tgctgttgac atttgtgggc tgtttaccaa 540 
tcccagatat tttaaaatta cccttagaaa 600 
ctttttgtta agtgacctaa ttaacaggag 660 
tggaatgtcc tctcaagtag aggaggttag 720 
ggatccagac atgataagat acattgatga 780 
aaaaaaatgc tttatttgtg aaatttgtga 840 
ctgcaataaa caagttaaca acaacaattg 900 
ggtgtgggag gttttttaaa gcaagtaaaa 960 
tctctagtca aggcactata catcaaatat 1020 
ctaaaggtac acaatttttg agcatagtta 1080 
gtaagaaaaa acagtatgtt atgattataa 1140 
tatttttcca taattttctt gtatagcagt 1200 
aagcaagcaa gagttctatt actaaacaca 1260 
ggaaagtcct tggggtcttc tacctttctc 1320 
tcagcagtag cctcatcatc actagatggc 1380 
ttaaaggcat tccaccactg ctcccattca 1440 
caaacaatta gaatcagtag tttaacacat 1500 
gagctttaaa tctctgtagg tagtttgtcc 1560 
ttcacaaaga tccgggacca aagcggccat 1620 
atggatgcgc ggatagccgc tgctggtttc 1680 
gaactccgcg aggtcgtcca gcctcaggca 174 0 
cccggggtgg gcgaagaact ccagcatgag 1800 
gtcccggaaa acgattccga agcccaacct 1860 
tcgtgatggc aggttgggcg tcgcttggtc 1920 
cggggatcct ctagaggcat ttcagtttcg 1980 
cgaaccccag agtcccgctc agaagaactc 204 0 
cgaatcggga gcggcgatac cgtaaagcac 2100 
ctcttcagca atatcacggg tagccaacgc 2160 
ccggccacag tcgatgaatc cagaaaagcg 2220 
ggcatcgcca tgggtcacga cgagatcctc 2280 
gaacagttcg gctggcgcga gcccctgatg 2340 
accggcttcc atccgagtac gtgctcgctc 2400 
gcaggtagcc ggatcaagcg tatgcagccg 24 60 
ctcggcagga gcaaggtgag atgacaggag 2520 
ccagtccctt cccgcttcag tgacaacgtc 2580 
ggccagccac gatagccgcg ctgcctcgtc 2640 
ggtcttgaca aaaagaaccg ggcgcccctg 2700 
gcagccgatt gtctgttgtg cccagtcata 2760 
agaacctgcg tgcaatccat cttgttcaat 2820 
atcagatctt gatcccctgc gccatcagat 2880 
tttgcagggc ttcccaacct taccagaggg 2940 
tgtccataaa accgcccagt ctagctatcg 3000 
tctctttgcg cttgcgtttt cccttgtcca 3060 
tcagcaccgt ttctgcggac tggctttcta 3120 
ccctgagtgc ttgcggcagc gtgaagcttt 3180 
cctcactact tctggaatag ctcagaggcc 3240 
aaaattagtc agccatgggg cggagaatgg 3300 
ggcggagtta ggggcgggac tatggttgct 3360 
ctgcctgctg gggagcctgg ggactttcca 3420 
ctttgcatac ttctgcctgc tggggagcct 3480 
ttccacagct gcctcgcgcg tttcggtgat 354 0 
ccggagacgg tcacagcttg tctgtaagcg 3600 
gcgtcagcgg gtgttggcgg gtgtcggggc 3660 
ggagtgtata ctggcttaac tatgcggcat 3720 
tgcggtgtga aataccgcac agatgcgtaa 3780 
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ggagaaaata ccgcatcagg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg 3840 

tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag 3900 

aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc 3960 

gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca 4020 

5 aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt 4080 

ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc 4140 

tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc 4200 

tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc 4260 

ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact 4 320 

10 tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg 4380 

ctacagagtt cttgaagtgg tggcctaact acggctacac tagaaggaca gtatttggta 4440 

tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca 4500 

aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa 4 5 60 

aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg 4 620 

15 aaaactcacg ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc 4 680 

ttttaaatta aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg 4740 

acagttacca atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat 4800 

ccatagttgc ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg 4860 

gccccagtgc tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa 4 920 

20 taaaccagcc agccggaagg gccgagcgca gaagtggtcc tgcaacttta. tccgcctcca 4 980 

tccagtctat taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc 5040 

gcaacgttgt tgccattgct gcaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt 5100 

cattcagctc cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa 5160 

aagcggttag ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat 5220 

25 cactcatggt tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct 5280 

tttctgtgac tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga 5340 

gttgctcttg cccggcgtca acacgggata ataccgcgcc acatagcaga actttaaaag 5400 

tgctcatcat tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga 54 60 

gatccagttc gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca 5520 

30 ccagcgtttc tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg 5580 

cgacacggaa atgttgaata ctcatactct tcctttttca atattattga agcatttatc 5640 

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag 5700 

gggttccgcg cacatttccc cgaaaagtgc cacctgacgt ctaagaaacc attattatca 57 60 

tgacattaac ctataaaaat aggcgtatca cgaggccctt tcgtcttcaa g 5811 

35 



<210> 3 
<211> 2005 
<212> DNA 
40 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: vHM41 



45 <400> 3 

agtagaaaca agggtatttt tctttaccta gctaggcctg cgcgcaatta accctcacta 60 

aagggaacaa aagctggagc tccaccgcgg tggcggccgc tctagcatat gtttttcgtc 120 

tcagccaatc cctctagcat atgtttttcg tctcagccaa tccctctaga agcttcgtac 180 

gcatgcaagc tttaaatgca tatagtgcac cgcatgtttc cgttcttcac acatatgaaa 240 

50 acaaggccca ttgcaatggc aagaagcaaa aagcatgatg ccccgaagct aaaccaaagt 300 

atcacatctt tgtagccact actcaatttg actgggtcaa tttgtattct attttgcatc 360 

gcttcttctc tgtatttgct gtgatcataa gtattgttcc ttatactagc catacaatca 420 

tcgtcacatt tatgaaaaat ttcaaagcaa ccagtgccat cctcttcagc attttccctt 480 

aattgtttcc tcactcgctc atacagcttg ttcatctctg aatcagccaa atcaatagtg 54 0 

55 tgctggtttt ccattgccac aagaagttca gcattgtaag accatacttc tgtgatggag 600 

tctttggtcc agttaattaa attgccaatc tgcttttcca cttcagtgaa ttcattatct. 660 

attagctcaa attgctggtt ggttttctta atgagtctat ttaactttcc ggttatctga 720 

tcaattgccg attgggtgct tttgtagtct gctgcagttc cttctccttg tgcattctga 780 

tgcctgaaac cgtaccaccc gtcgaccaga ccttcccaac cattttcaat aaaccctgct 84 0 

60 atagcaccaa acaggcctct ttttttcctt tttttggaag gttcgggaac gttcttcatc 900 

ccagttgcca ataataaact ttcctgtttt acatatcttg ggcatttgcc aactgctctg 960 
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ctatttatgt tttgaaaagg caatctgctt gttatagtcc ctccactgtg gtagcattcc 1020 

ccttcgcaat tagcatcaac ctgcacatcg ctctggatcc ccatggactt tcccctcaag 1080 

aagctggcac gatttggagc tatgaaagcc ccattgaaac taaaagtaac tgtatcattg 1140 

ggatccaaga tcaaccaatg aaaatcaatc cgtccggacc ggccatttat ctgcggtcgt 1200 

5 gttcctggac tcggcacaaa agattgatga tatttggaac tcccgactgt tatcagttta 1260 

tttccactcc catatagttt ggtctgttcg gtggttgatc ctgaatggtg gattccccag 1320 

actatcagag ctgattctct ccctgtgttt ttgtatgatt ttgtcatttg tgggaaagaa 1380 

gcattgtctg tatttgacag gagccactcc atttctgcat agaatgaaga ccctgatctt 1440 

ctacatgcac tagttgttcc gttggtcctt attccactat atgtgaatcc cattgtttct 1500 

10 ttgtcaatcc cacctgatcc tctgaggatt tgtcgcaatg cctcttcatt aacaaacttc 1560 

cccgggtaac aaacatcatt tccttctcgt ctctcgatta ttagatcagc tgaaaattct 1620 

agaaattggt cgcattgagg tggtccggta atggtcccta acagtccgca ttggccaaga 1680 

tcagtggttc ttttcccttt tgagcaaatt ttggggatgt ttgtccgctc cactgtttcc 1740 

gttgcattga caacttctac tcctctctca gtgagtgtgt ttactttggt gccatttgat 1800 

15 acagcatgat gtccaagaca aattttgtct gcatttgtgg ggatgactgc cgcaagggcg 1860 

aaaaccagga tttgagtgtt cattttggta cctacaatat gtttttcgtc tcagccaatc 1920 

cctggtacaa tatgtttttc gtctcagcca atcctggtac gatgtggatg tcactcagtg 1980 

agtgattatc taccctgctt ttgct 2005 

20 

<210> 4 
<211> 1146 
<212> DNA 

<213> Artificial Sequence 

25 

<220> 

<223> Description of Artificial Sequence: vHM81 



<400> 4 

30 agtagaaaca agggtatttt tctttaccta gctaggcctg cgcgcaatta accctcacta 60 

aagggaacaa aagctggagc tccaccgcgg tggcggccgc tctagcatat gtttttcgtc 120 

tcagccaatc cctctagcat atgtttttcg tctcagccaa tccctctaga agcttcgtac 180 

gcatgcttaa ataagctgaa acgagaaagt tcttatctct tgctccactt caagcggtag 240 

ttgtaaggct tgcataaatg ttatttgttc aaaactattc tctgttatct tcaatctatg 300 

35 tctcacttct tcaattaacc atcttatttc ttcaaatttc tgactcaatt gttctcgcca 360 

ttttccgttt ctgctttgga gggagtggag gtcccccatt ctcattactg cttctccaag 420 

cgaatctctg tatagtttca gagactcgaa ctgtgttatc attccattca agtcctccga 480 

tgaggacccc aattgcattt ttgacatcct catcagtatg tcctggaaga gaaggcaatg 540 

gtgaaatttc gccgacaatt gctccctcat cggttaaagc ccttaatagt atgagagttt 600 

40 ccagccgatc gaaaatcaca ctgaagtttg ctttcagtat gatgttcttc cccatgatcg 660 

cctggtccat tctgatgcaa agggagcctg ccactttctg tttgggcatg agcatgaacc 720 

agtcccttga catctcttca agagtcatgt cagttaggta gcgtgtagca ggtacagagg 780 

caatggtcat tttaagtgcc tcatcggatt cgtcctccag aatccgctcc actatctgct 840 

ttccaacacg agtagctgtg tcgatgtcca gaccaagagt gctgcctctt cccctcaggg 900 

45 acttctgatc tcggcgaagt cggtcaagga atggggcatc acccatttct tggtctgcaa 960 

atcgtttgcg gacatgccaa agaaagcagt ctacctgaaa gcttgacaca gtgttggaat 1020 

ccattatggt acctacaata tgtttttcgt ctcagccaat ccctggtaca atatgttttt 1080 

cgtctcagcc aatcctggta cgatgtggat gtcactcagt gagtgattat ctaccctgct 1140 
tttgct ^ 114 6 

50 



<210> 5 
<211> 5860 
<212> DNA 
55 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: pAM424 



60 



<400> 5 

catcgattgg ctgactgatg agtccgtgag gacgaaacga aaaacatatt gtagagctcg 



60 
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aattcatcga aatgaccgac caagcgacgc 
ccgccgcctt ctatgaaagg ttgggcttcg 
tcctccagcg cggggatctc atgctggagt 
cgagttggtt cagctgctgc ctgaggctgg 
5 aatccgtcgg catccaggaa accagcagcg 
aggagtgggg aggcacgatg gccgctttgg 
tgtggtgtga cataattgga caaactacct 
aaatttttaa gtgtataatg tgttaaacta 
ccaacctatg gaactgatga atgggagcag 

10 ttttgctcag aagaaatgcc atctagtgat 
actcctccaa aaaagaagag aaaggtagaa 
agttttttga gtcatgctgt gtttagtaat 
acaaaggaaa aagctgcact gctatacaag 
ataagtaggc ataacagtta taatcataac 

15 agagtgtctg ctattaataa ctatgctcaa 
aaaggggtta ataaggaata tttgatgtat 
taccacattt gtagaggttt tacttgcttt 
gaaacataaa atgaatgcaa ttgttgttgt 
caaataaagc aatagcatca caaatttcac 

20 ttgtggtttg tccaaactca tcaatgtatc 
ctgtgtcctc ataaacccta acctcctcta 
catccaccct ctgtgtcctc ctgttaatta 
ggtttttcac agaccgcttt ctaagggtaa 
gctgtgttcc agaagtgttg gtaaacagcc 

25 tgtcagcttt gcacaagggc ccaacaccct 
tagtaatgtg caaaacagga ggcacatttt 
tgttttcatt tttacttgga tcaggaaccc 
ccaaaacagc cttgtggtca gtgttcatct 
ttacagtttg agcaggatat ttggtcctgt 

30 gttccccacc aacagcaaaa aaatgaaaat 
tttcatgagt tttttgtgtc cctgaatgca 
cagttttaac agtaacagct tcccacatca 
aattaggcaa aggaattctt gaagacgaaa 
taatgtcatg ataataatgg tttcttagac 

35 cggaacccct atttgtttat ttttctaaat 
ataaccctga taaatgcttc aataatattg 
ccgtgtcgcc cttattccct tttttgcggc 
aacgctggtg aaagtaaaag atgctgaaga 
actggatctc aacagcggta agatccttga 

40 gatgagcact tttaaagttc tgctatgtgg 
agagcaactc ggtcgccgca tacactattc 
cacagaaaag catcttacgg atggcatgac 
catgagtgat aacactgcgg ccaacttact 
aaccgctttt ttgcacaaca tgggggatca 

45 gctgaatgaa gccataccaa acgacgagcg 
aacgttgcgc aaactattaa ctggcgaact 
agactggatg gaggcggata aagttgcagg 
ctggtttatt gctgataaat ctggagccgg 
actggggcca gatggtaagc cctcccgtat 

50 aactatggat gaacgaaata gacagatcgc 
gtaactgtca gaccaagttt actcatatat 
atttaaaagg atctaggtga agatcctttt 
tgagttttcg ttccactgag cgtcagaccc 
tccttttttt ctgcgcgtaa tctgctgctt 

55 ggtttgtttg ccggatcaag agctaccaac 
agcgcagata ccaaatactg tccttctagt 
ctctgtagca ccgcctacat acctcgctct 
tggcgataag tcgtgtctta ccgggttgga 
gcggtcgggc tgaacggggg gttcgtgcac 

60 cgaactgaga tacctacagc gtgagctatg 
ggcggacagg tatccggtaa gcggcagggt 
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6 

ccaacctgcc atcacgagat ttcgattcca 120 
gaatcgtttt ccgggacgcc ggctggatga 180 
tcttcgccca ccccgggctc gatcccctcg 240 
acgacctcgc ggagttctac cggcagtgca 300 
gctatccgcg catccatgcc cccgaactgc 360 
tcccggatct ttgtgaagga accttacttc 420 
acagagattt aaagctctaa ggtaaatata 480 
ctgattctaa ttgtttgtgt attttagatt 540 
tggtggaatg cctttaatga ggaaaacctg 600 
gatgaggcta ctgctgactc tcaacattct 660 
gaccccaagg actttccttc agaattgcta 720 
agaactcttg cttgctttgc tatttacacc 780 
aaaattatgg aaaaatattc tgtaaccttt 840 
atactgtttt ttcttactcc acacaggcat 900 
aaattgtgta cctttagctt tttaatttgt 960 
agtgccttga ctagagatca taatcagcca 1020 
aaaaaacctc ccacacctcc ccctgaacct 1080 
taacttgttt attgcagctt ataatggtta 1140 
aaataaagca tttttttcac tgcattctag 1200 
ttatcatgtc tggatcccca ggaagctcct 1260 
cttgagagga cattccaatc ataggctgcc 1320 
ggtcacttaa caaaaaggaa attgggtagg 1380 
ttttaaaata tctgggaagt cccttccact 1440 
cacaaatgtc aacagcagaa acatacaagc 1500 
gctcatcaag aagcactgtg gttgctgtgt 1560 
ccccacctgt gtaggttcca aaatatctag 1620 
agcactccac tggataagca ttatccttat 1680 
gctgactgtc aactgtagca ttttttgggg 1740 
agtttgctaa cacaccctgc agctccaaag 1800 
ttgacccttg aatgggtttt ccagcaccat 18 60 
agtttaacat agcagttacc ccaataacct 1920 
aaatatttcc acaggttaag tcctcattta 1980 
gggcctcgtg atacgcctat ttttataggt 2040 
gtcaggtggc acttttcggg gaaatgtgcg 2100 
acattcaaat atgtatccgc tcatgagaca 2160 
aaaaaggaag agtatgagta ttcaacattt 2220 
attttgcctt cctgtttttg ctcacccaga 2280 
tcagttgggt gcacgagtgg gttacatcga 2340 
gagttttcgc cccgaagaac gttttccaat 2400 
cgcggtatta tcccgtgttg acgccgggca 24 60 
tcagaatgac ttggttgagt actcaccagt 2520 
agtaagagaa ttatgcagtg ctgccataac 2580 
tctgacaacg atcggaggac cgaaggagct 2640 
tgtaactcgc cttgatcgtt gggaaccgga 2700 
tgacaccacg atgcctgcag caatggcaac 2760 
acttactcta gcttcccggc aacaattaat 2820 
accacttctg cgctcggccc ttccggctgg 2880 
tgagcgtggg tctcgcggta tcattgcagc 2940 
cgtagttatc tacacgacgg ggagtcaggc 3000 
tgagataggt gcctcactga ttaagcattg 3060 
actttagatt gatttaaaac ttcattttta 3120 
tgataatctc atgaccaaaa tcccttaacg 3180 
cgtagaaaag atcaaaggat cttcttgaga 3240 
gcaaacaaaa aaaccaccgc taccagcggt 3300 
tctttttccg aaggtaactg gcttcagcag 3360 
gtagccgtag ttaggccacc acttcaagaa 3420 
gctaatcctg ttaccagtgg ctgctgccag 3480 
ctcaagacga tagttaccgg ataaggcgca 3540 
acagcccagc ttggagcgaa cgacctacac 3600 
agaaagcgcc acgcttcccg aagggagaaa 3660 
cggaacagga gagcgcacga gggagcttcc 3720 
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agggggaaac gcctggtatc tttatagtcc 
tcgatttttg tgatgctcgt caggggggcg 
ctttttacgg ttcctggcct tttgctggcc 
ccctgattct gtggataacc gtattaccgc 
5 ccgaacgacc gagcgcagcg agtcagtgag 
ttttctcctt acgcatctgt gcggtatttc 
ctgctctgat gccgcatagt taagccagta 
atggctgcgc cccgacaccc gccaacaccc 
ccggcatccg cttacagaca agctgtgacc 

10 tcaccgtcat caccgaaacg cgcgaggcag 
aagtccccag gctccccagc aggcagaagt 
accaggtgtg gaaagtcccc aggctcccca 
aattagtcag caaccatagt cccgccccta 
agttccgccc attctccgcc ccatggctga 

15 gccgcctcgg cctctgagct attccagaag 
ttttgcaaaa agcttcacgc tgccgcaagc 
ggaacacgta gaaagccagt ccgcagaaac 
gggctatctg gacaagggaa aacgcaagcg 
ttacatggcg atagctagac tgggcggttt 

20 ctggggcgcc ctctggtaag gttgggaagc 
cgccaaggat ctgatggcgc aggggatcaa 
ttcgcatgat tgaacaagat ggattgcacg 
tattcggcta tgactgggca caacagacaa 
tgtcagcgca ggggcgcccg gttctttttg 

25 aactgcagga cgaggcagcg cggctatcgt 
ctgtgctcga cgttgtcact gaagcgggaa 
ggcaggatct cctgtcatct caccttgctc 
caatgcggcg gctgcatacg cttgatccgg 
atcgcatcga gcgagcacgt actcggatgg 

30 acgaagagca tcaggggctc gcgccagccg 
ccgacggcga ggatctcgtc gtgacccatg 
aaaatggccg cttttctgga ttcatcgact 
aggacatagc gttggctacc cgtgatattg 
gcttcctcgt gctttacggt atcgccgctc 

35 ttcttgacga gttcttctga gcgggactct 
ctgatgagtc cgtgaggacg aaacgaaaaa 



tgtcgggttt cgccacctct gacttgagcg 3780 
gagcctatgg aaaaacgcca gcaacgcggc 3840 
ttttgctcac atgttctttc ctgcgttatc 3900 
ctttgagtga gctgataccg ctcgccgcag 3960 
cgaggaagcg gaagagcgcc tgatgcggta 4020 
acaccgcata tggtgcactc tcagtacaat 4080 
tacactccgc tatcgctacg tgactgggtc 4140 
gctgacgcgc cctgacgggc ttgtctgctc 4200 
gtctccggga gctgcatgtg tcagaggttt 4260 
ctgtggaatg tgtgtcagtt agggtgtgga 4320 
atgcaaagca tgcatctcaa ttagtcagca 4380 
gcaggcagaa gtatgcaaag catgcatctc 4440 
actccgccca tcccgcccct aactccgccc 4500 
ctaatttttt ttatttatgc agaggccgag 4560 
tagtgaggag gcttttttgg aggcctaggc 4620 
actcagggcg caagggctgc taaaggaagc 4680 
ggtgctgacc ccggatgaat gtcagctact 4740 
caaagagaaa gcaggtagct tgcagtgggc 4800 
tatggacagc aagcgaaccg gaattgccag 4860 
cctgcaaagt aaactggatg gctttcttgc 4920 
gatctgatca agagacagga tgaggatcgt 4980 
caggttctcc ggccgcttgg gtggagaggc 5040 
tcggctgctc tgatgccgcc gtgttccggc 5100 
tcaagaccga cctgtccggt gccctgaatg 5160 
ggctggccac gacgggcgtt ccttgcgcag 5220 
gggactggct gctattgggc gaagtgccgg 5280 
ctgccgagaa agtatccatc atggctgatg 5340 
ctacctgccc attcgaccac caagcgaaac 54 00 
aagccggtct tgtcgatcag gatgatctgg 54 60 
aactgttcgc caggctcaag gcgcgcatgc 5520 
gcgatgcctg cttgccgaat atcatggtgg 5580 
gtggccggct gggtgtggcg gaccgctatc 5640 
ctgaagagct tggcggcgaa tgggctgacc 5700 
ccgattcgca gcgcatcgcc ttctatcgcc 5760 
ggggttcgaa tcctaccagg gattggctga 5820 
catatggtac 5860 



40 



45 



50 



55 



60 



<210> 6 
<211> 4610 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 



PHL2507 



<400> 6 

gaggcatttc 

cttgcggcag 

aatggcacca 

acagtggagc 

ggccaatgcg 

ttttcagctg 

tttgttaatg 

acaatgggat 

tcagggtctt 

ttcccacaaa 

ggaatccacc 

ctgataacag 

ccgcagataa 

gatacagtta 



agtcagttgc 
tcatccccac 
aagtaaacac 
ggacaaacat 
gactgttagg 
atctaataat 
aagaggcatt 
tcacatatag 
cattctatgc 
tgacaaaatc 
attcaggatc 
tcgggagttc 
atggccggtc 
cttttagttt 



tcaaggtacc 
aaatgcagac 
actcactgag 
ccccaaaatt 
gaccattacc 
cgagagacga 
gcgacaaatc 
tggaataagg 
agaaatggag 
atacaaaaac 
aaccaccgaa 
caaatatcat 
cggacggatt 
caatggggct 



aaaatgaaca 
aaaatttgtc 
agaggagtag 
tgctcaaaag 
ggaccacctc 
gaaggaaatg 
ctcagaggat 
accaacggaa 
tggctcctgt 
acagggagag 
cagaccaaac 
caatcttttg 
gattttcatt 
ttcatagctc 



ctcaaatcct 
ttggacatca 
aagttgtcaa 
ggaaaagaac 
aatgcgacca 
atgtttgtta 
caggtgggat 
caactagtgc 
caaatacaga 
aatcagctct 
tatatgggag 
tgccgagtcc 
ggttgatctt 
caaatcgtgc 



ggttttcgcc 
tgctgtatca 
tgcaacggaa 
cactgatctt 
atttctagaa 
cccggggaag 
tgacaaagaa 
atgtagaaga 
caatgcttct 
gatagtctgg 
tggaaataaa 
aggaacacga 
ggatcccaat 
cagcttcttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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aggggaaagt ccatggggat ccagagcgat 
tgctaccaca gtggagggac tataacaagc 
gcagttggca aatgcccaag atatgtaaaa 
aagaacgttc ccgaaccttc caaaaaaagg 
5 gggtttattg aaaatggttg ggaaggtctg 
aatgcacaag gagaaggaac tgcagcagac 
ataaccggaa agttaaatag actcattaag 
aatgaattca ctgaagtgga aaagcagatt 
atcacagaag tatggtctta caatgctgaa 
10 attgatttgg ctgattcaga gatgaacaag 
gaaaatgctg aagaggatgg cactggttgc 
tgtatggcta gtataaggaa caatacttat 
caaaatagaa tacaaattga cccagtcaaa 
tggtttagct tcggggcatc atgctttttg 
15 atatgtgtga agaacggaaa catgcggtgc 
ccccttgttt ctactccccc ccaacttcgg 
tttttttttt ttcccccgat gctggaggtc 
cccccccccc ggcgcggaac ggcggggcca 
ttttttgggg atcggccgct agcttctgtt 
20 tacagattaa atcagaacgc agaagcggtc 
gcgcggtggt cccacctgac cccatgccga 
gtagtgtggg gtctccccat gcgagagtag 
gctcagtcga aagactgggc ctttcgtttt 
agtaggacaa atccgccggg agcggatttg 
25 cgggcaggac gcccgccata aactgccagg 
gatggccttt ttgcgtttct acaaactctt 
gtatccgctc atgagacaat aaccctgata 
tatgagtatt caacatttcc gtgtcgccct 
tgtttttgct cacccagaaa cgctggtgaa 
30 acgagtgggt tacatcgaac tggatctcaa 
cgaagaacgt tttccaatga tgagcacttt 
ccgtgttgac gccgggcaag agcaactcgg 
ggttgagtac tcaccagtca cagaaaagca 
atgcagtgct gccataacca tgagtgataa 
35 cggaggaccg aaggagctaa ccgctttttt 
tgatcgttgg gaaccggagc tgaatgaagc 
gcctgtagca atggcaacaa cgttgcgcaa 
ttcccggcaa caattaatag actggatgga 
ctcggccctt ccggctggct ggtttattgc 
40 tcgcggtatc attgcagcac tggggccaga 
cacgacgggg agtcaggcaa ctatggatga 
ctcactgatt aagcattggt aactgtcaga 
tttaaaactt catttttaat ttaaaaggat 
gaccaaaatc ccttaacgtg agttttcgtt 
45 caaaggatct tcttgagatc ctttttttct 
accaccgcta ccagcggtgg tttgtttgcc 
ggtaactggc ttcagcagag cgcagatacc 
aggccaccac ttcaagaact ctgtagcacc 
accagtggct gctgccagtg gcgataagtc 
50 gttaccggat aaggcgcagc ggtcgggctg 
ggagcgaacg acctacaccg aactgagata 
gcttcccgaa gggagaaagg cggacaggta 
gcgcacgagg gagcttccag ggggaaacgc 
ccacctctga cttgagcgtc gatttttgtg 
55 aaacgccagc aacgcggcct ttttacggtt 
gttctttcct gcgttatccc ctgattcatt 
cccgtgcgga gtcggagagc gctccctgag 
ggccttcggt ccctcgtgtg tcccggtcgt 
cccgctctgg agacacgggc cggccccctg 
60 ccggcccggc gctgctcccg cgtgtgtcct 
tgtgtggctg cgatggtggc gtttttgggg 



gtgcaggttg atgctaattg cgaaggggaa 900 
agattgcctt ttcaaaacat aaatagcaga 960 
caggaaagtt tattattggc aactgggatg 1020 
aaaaaaagag gcctgtttgg tgctatagca 1080 
gtcgacgggt ggtacggttt caggcatcag 1140 
tacaaaagca cccaatcggc aattgatcag 1200 
aaaaccaacc agcaatttga gctaatagat 1260 
ggcaatttaa ttaactggac caaagactcc 1320 
cttcttgtgg caatggaaaa ccagcacact 1380 
ctgtatgagc gagtgaggaa acaattaagg 1440 
tttgaaattt ttcataaatg tgacgatgat 1500 
gatcacagca aatacagaga agaagcgatg 1560 
ttgagtagtg gctacaaaga tgtgatactt 1620 
cttcttgcca ttgcaatggg ccttgttttc 1680 
actatttgta tataggtttg gaaaaaaaca 1740 
aggtcgacca gtactccggg cgaaactttg 1800 
gaccagatgt ccgaaagtgt cccccccccc 18 60 
ctctggactc tttttttttt tttttttttt 1920 
ttggcggatg agagaagatt ttcagcctga 1980 
tgataaaaca gaatttgcct ggcggcagta 2040 
actcagaagt gaaacgccgt agcgccgatg 2100 
ggaactgcca ggcatcaaat aaaacgaaag 2160 
atctgttgtt tgtcggtgaa cgctctcctg 2220 
aacgttgcga agcaacggcc cggagggtgg 2280 
catcaaatta agcagaaggc catcctgacg 2340 
ttgtttattt ttctaaatac attcaaatat 2400 
aatgcttcaa taatattgaa aaaggaagag 24 60 
tattcccttt tttgcggcat tttgccttcc 2520 
agtaaaagat gctgaagatc agttgggtgc 2580 
cagcggtaag atccttgaga gttttcgccc 2640 
taaagttctg ctatgtggcg cggtattatc 2700 
tcgccgcata cactattctc agaatgactt 2760 
tcttacggat ggcatgacag taagagaatt 2820 
cactgcggcc aacttacttc tgacaacgat 2880 
gcacaacatg ggggatcatg taactcgcct 2940 
cataccaaac gacgagcgtg acaccacgat 3000 
actattaact ggcgaactac ttactctagc 3060 
ggcggataaa gttgcaggac cacttctgcg 3120 
tgataaatct ggagccggtg agcgtgggtc 3180 
tggtaagccc tcccgtatcg tagttatcta 324 0 
acgaaataga cagatcgctg agataggtgc 3300 
ccaagtttac tcatatatac tttagattga 3360 
ctaggtgaag atcctttttg ataatctcat 3420 
ccactgagcg tcagaccccg tagaaaagat 3480 
gcgcgtaatc tgctgcttgc aaacaaaaaa 3540 
ggatcaagag ctaccaactc tttttccgaa 3600 
aaatactgtc cttctagtgt agccgtagtt 3660 
gcctacatac ctcgctctgc taatcctgtt 3720 
gtgtcttacc gggttggact caagacgata 3780 
aacggggggt tcgtgcacac agcccagctt 3840 
cctacagcgt gagctatgag aaagcgccac 3900 
tccggtaagc ggcagggtcg gaacaggaga 3960 
ctggtatctt tatagtcctg tcgggtttcg 4020 
atgctcgtca ggggggcgga gcctatggaa 4080 
cctggccttt tgctggcctt ttgctcacat 4140 
aatgcaggtc acgatccttt ctggcgagtc 4200 
cgcgtgcggc ccgagaggtc gcgcctggcc 4260 
aggaggggcc ggccgaaaat gcttccggct 4320 
cgtgtggcac gggcggccgg gagggcgtcc 4380 
ggggttgacc agagggcccc gggcgctccg 4440 
acaggtgtcc gtgtccgtgt cgcgcgtcgc 4500 
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ctgggccggc ggcgtggtcg gtgacgcgac ctcccggccc cgggggaggt atatctttcg 4560 
ctccgagtcg gcattttggg ccgccgggtt attagtagaa acaggggtac 4 610 

5 <210> 7 

<211> 3558 
<212> DNA 

<213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: pHL2583 
<400> 7 

tattagtaga aacagggtat tttttattct agtacattac gccccgccct gccactcatc 60 

15 gcagtactgt tgtaattcat taagcattct gccgacatgg aagccatcac agacggcatg 120 
atgaacctga atcgccagcg gcatcagcac cttgtcgcct tgcgtataat atttgcccat 180 
ggtgaaaacg ggggcgaaga agttgtccat attggccacg tttaaatcaa aactggtgaa 240 
actcacccag ggattggctg agacgaaaaa catattctca ataaaccctt tagggaaata 300 
ggccaggttt tcaccgtaac acgccacatc ttgcgaatat atgtgtagaa actgccggaa 360 

20 atcgtcgtgg tattcactcc agagcgatga aaacgtttca gtttgctcat ggaaaacggt 420 
gtaacaaggg tgaacactat cccatatcac cagctcaccg tctttcattg ccatacggaa 4 80 
ttccggatga gcattcatca ggcgggcaag aatgtgaata aaggccggat aaaacttgtg 540 
cttatttttc tttacggtct ttaaaaaggc cgtaatatcc agctgaacgg tctggttata 600 
ggtacattga gcaactgact gaaatgcctc aaaatgttct ttacgatgcc attgggatat 660 

25 atcaacggtg gtatatccag tgattttttt ctccatgatt atggccatta cccttgtttc 720 
tactcccccc caacttcgga ggtcgaccag tactccgggc gaaactttgt tttttttttt 780 
tcccccgatg ctggaggtcg accagatgtc cgaaagtgtc cccccccccc cccccccccg 840 
gcgcggaacg gcggggccac tctggactct tttttttttt tttttttttt tttttgggga 900 
tcggccgcta gcttctgttt tggcggatga gagaagattt tcagcctgat acagattaaa 960 

30 tcagaacgca gaagcggtct gataaaacag aatttgcctg gcggcagtag cgcggtggtc 1020 
ccacctgacc ccatgccgaa ctcagaagtg aaacgccgta gcgccgatgg tagtgtgggg 1080 
tctccccatg cgagagtagg gaactgccag gcatcaaata aaacgaaagg ctcagtcgaa 1140 
agactgggcc tttcgtttta tctgttgttt gtcggtgaac gctctcctga gtaggacaaa 1200 
tccgccggga gcggatttga acgttgcgaa gcaacggccc ggagggtggc gggcaggacg 1260 

35 cccgccataa actgccaggc atcaaattaa gcagaaggcc atcctgacgg atggcctttt 1320 
tgcgtttcta caaactcttt tgtttatttt tctaaataca ttcaaatatg tatccgctca 1380 
tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc 1440 
aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc 1500 
acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt 1560 

40 acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt 1620 
ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtgttgacg 1680 
ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact 1740 
caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg 1800 
ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga 18 60 

45 aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg 1920 
aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa 1980 
tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac 2040 
aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc 2100 
cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca 2160 

50 ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga 2220 
gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta 2280 
agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc 2340 
atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc 2400 
cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt 24 60 

55 cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac 2520 
cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct 2580 
tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact 2640 
tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg 2700 
ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata 27 60 

60 aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga 2820 
cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag 2880 
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ggagaaaggc 
agcttccagg 
ttgagcgtcg 
acgcggcctt 
cgttatcccc 
tcggagagcg 
cctcgtgtgt 
gacacgggcc 
ctgctcccgc 
gatggtggcg 
gcgtggtcgg 
cattttgggc 



ggacaggtat 
gggaaacgcc 
atttttgtga 
tttacggttc 
tgattcatta 
ctccctgagc 
cccggtcgta 
ggccccctgc 
gtgtgtcctg 
tttttgggga 
tgacgcgacc 
cgccgggt 



ccggtaagcg 
tggtatcttt 
tgctcgtcag 
ctggcctttt 
atgcaggtca 
gcgtgcggcc 
ggaggggccg 
gtgtggcacg 
gggttgacca 
caggtgtccg 
tcccggcccc 



gcagggtcgg 
atagtcctgt 
gggggcggag 
gctggccttt 
cgatcctttc 
cgagaggtcg 
gccgaaaatg 
ggcggccggg 
gagggccccg 
tgtccgtgtc 
gggggaggta 



aacaggagag 
cgggtttcgc 
cctatggaaa 
tgctcacatg 
tggcgagtcc 
cgcctggccg 
cttccggctc 
agggcgtccc 
ggcgctccgt 
gcgcgtcgcc 
tatctttcgc 



cgcacgaggg 
cacctctgac 
aacgccagca 
ttctttcctg 
ccgtgcggag 
gccttcggtc 
ccgctctgga 
cggcccggcg 
gtgtggctgc 
tgggccggcg 
tccgagtcgg 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3558 



<210> 8 
<211> 4343 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: pHL2989 
<400> 8 

ctttctggcg agtccccgtg cggagtcgga gagcgctccc tgagcgcgtg cggcccgaga 60 
ggtcgcgcct ggccggcctt cggtccctcg tgtgtcccgg tcgtaggagg ggccggccga 120 
aaatgcttcc ggctcccgct ctggagacac gggccggccc cctgcgtgtg gcacgggcgg 180 
ccgggagggc gtccccggcc cggcgctgct cccgcgtgtg tcctggggtt gaccagaggg 240 
ccccgggcgc tccgtgtgtg gctgcgatgg tggcgttttt ggggacaggt gtccgtgtcc 300 
gtgtcgcgcg tcgcctgggc cggcggcgtg gtcggtgacg cgacctcccg gccccggggg 360 
aggtatatct ttcgctccga gtcggcattt tgggccgccg ggttattagt agaaacaggg 420 
tattttttat actagtaagc tcgaaggagt ccaccatgag taaaggagaa gaacttttca 4 80 
ctggagttgt cccaattctt gttgaattag atggtgatgt taatgggcac aaattttctg 540 
tcagtggaga gggtgaaggt gatgcaacat acggaaaact tacccttaaa tttatttgca 600 
ctactggaaa actacctgtt ccatggccaa cacttgtcac tactttcact tatggtgttc 660 
aatgcttttc aagataccca gatcatatga aacagcatga ctttttcaag agtgccatgc 720 
ccgaaggtta tgtacaggaa agaactatat ttttcaaaga tgacgggaac tacaagacac 780 
gtgctgaagt caagtttgaa ggtgataccc ttgttaatag aatcgagtta aaaggtattg 840 
attttaaaga agatggaaac attcttggac acaaattgga atacaactat aactcacaca 900 
atgtatacat catggctgac aagcagaaga acggaatcaa ggccaacttc aagacccgcc 960 
acaacatcga ggacggcggc gtgcagctgg ccgaccacta ccagcagaac accccaattg 1020 
gcgatggccc tgtcctttta ccagacaacc attacctgtc cacacaatct gccctttcga 1080 
aagatcccaa cgaaaagaga gaccacatgg tccttcttga gtttgtaaca gctgctggga 1140 
ttacacatgg catggatgaa ctatacaagg gatcccatca ccatcaccat cactaagctc 1200 
catggtctag atatctagta cattacgccc cgccctgcca ctcatcgcag tactgttgta 1260 
attcattaag cattctgccg acatggaagc catcacagac ggcatgatga acctgaatcg 1320 
ccagcggcat cagcaccttg tcgccttgcg tataatattt gcccatggtg aaaacggggg 1380 
cgaagaagtt gtccatattg gccacgttta aatcaaaact ggtgaaactc acccagggat 1440 
tggcactcac aaagaacatg ttctcgatga atcctttagg gaagtaggcc aggttttcac 1500 
cgtaacacgc cacatcttgc gaatatatgt gtagaaactg ccggaaatcg tcgtggtatt 1560 
cactccagag cgatgaaaac gtttcagttt gctcatggaa aacggtgtaa caagggtgaa 1620 
cactatccca tatcaccagc tcaccgtctt tcattgccat acggaattcc ggatgagcat 1680 
tcatcaggcg ggcaagaatg tgaataaagg ccggataaaa cttgtgctta tttttcttta 1740 
cggtctttaa aaaggccgta atatccagct gaacggtctg gttataggta cattgagcaa 1800 
ctgactgaaa tgcctcaaaa tgttctttac gatgccattg ggatatatca acggtggtat 1860 
atccagtgat ttttttctcc atgattatgc aaaaaatacc cttgtttcta ctccccccca 1920 
acttcggagg tcgaccagta ctccgggcga aactttgttt tttttttttc ccccgatgct 1980 
ggaggtcgac cagatgtccg aaagtgtccc cccccccccc cccccccggc gcggaacggc 2040 
ggggccactc tggactcttt tttttttttt tttttttttt tttggggatc ggccgctagc 2100 
ttctgttttg gcggatgaga gaagattttc agcctgatac agattaaatc agaacgcaga 2160 
agcggtctga taaaacagaa tttgcctggc ggcagtagcg cggtggtccc acctgacccc 2220 
atgccgaact cagaagtgaa acgccgtagc gccgatggta gtgtggggtc tccccatgcg 2280 



WO 00/53786 PCT/EP00/01903 

11 

agagtaggga actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt 2340 
tcgttttatc tgttgtttgt cggtgaacgc tctcctgagt aggacaaatc cgccgggagc 2400 
ggatttgaac gttgcgaagc aacggcccgg agggtggcgg gcaggacgcc cgccataaac 2460 
tgccaggcat caaattaagc agaaggccat cctgacggat ggcctttttg cgtttctaca 2520 
5 aactcttttg tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac 2580 
cctgataaat gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg 2640 
tcgcccttat tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc 2700 
tggtgaaagt aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg 27 60 
atctcaacag cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga 2820 

10 gcacttttaa agttctgcta tgtggcgcgg tattatcccg tgttgacgcc gggcaagagc 2880 
aactcggtcg ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag 2940 
aaaagcatct tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga 3000 
gtgataacac tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg 3060 
cttttttgca caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga 3120 

15 atgaagccat accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt 3180 
tgcgcaaact attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact 3240 
ggatggaggc ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt 3300 
ttattgctga taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg 3360 
ggccagatgg taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta 3420 

20 tggatgaacg aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac 3480 
tgtcagacca agtttactca tatatacttt agattgattt aaaacttcat ttttaattta 3540 
aaaggatcta ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt 3 600 
tttcgttcca ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt 3660 
tttttctgcg cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt 3720 

25 gtttgccgga tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc 3780 
agataccaaa tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg 3840 
tagcaccgcc tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg 3900 
ataagtcgtg tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt 3960 
cgggctgaac ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac 4020 

30 tgagatacct acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg 4080 
acaggtatcc ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg 4140 
gaaacgcctg gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat 4200 
ttttgtgatg ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt 4260 
tacggttcct ggccttttgc tggccttttg ctcacatgtt ctttcctgcg ttatcccctg 4320 

35 attcattaat gcaggtcacg ate 4343 



<210> 9 
<211> 3888 
40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: pHLl920 

45 

<400> 9 

cccaaaaaaa aaaaaaaaaa aaaaaaaaag agtccagagt ggccccgccg ttccgcgccg 60 
gggggggggg ggggggggga cactttegga catctggtcg acctccagca tegggggaaa 120 
aaaaaaaaac aaagtttcgc ceggagtact ggtcgacctc cgaagttggg ggggagtaga 180 

50 aacagggtag ataatcactc actgagtgac atccacatcg cgagcgcgcg taatacgact 240 
cactataggg cgaattgggt accgggcccc ccctcgaggt egaeggtate gataagcttc 300 
gacgagattt tcaggagcta aggaagctaa aatggagaaa aaaatcactg gatataccac 360 
cgttgatata tcccaatggc ategtaaaga acattttgag gcatttcagt cagttgetea 420 
atgtacctat aaccagaccg ttcagctgga tattaeggee tttttaaaga ccgtaaagaa 480 

55 aaataagcac aagttttatc eggectttat tcacattctt gcccgcctga tgaatgetea 540 
teeggaatte cgtatggcaa tgaaagaegg tgagctggtg atatgggata gtgttcaccc 600 
ttgttacacc gttttccatg agcaaactga aacgttttca tcgctctgga gtgaatacca 660 
cgacgatttc eggcagttte tacacatata ttegcaagat gtggcgtgtt acggtgaaaa 720 
cctggcctat ttccctaaag ggtttattga gaatatgttt ttcgtctcag ccaatccctg 780 

60 ggtgagtttc accagttttg atttaaacgt ggccaatatg gacaacttct tcgcccccgt 840 
tttcaccatg ggcaaatatt ataegcaagg cgacaaggtg ctgatgccgc tggegattea 900 
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ggttcatcat gccgtttgtg atggcttcca tgtcggcaga atgcttaatg aattacaaca 960 

gtactgcgat gagtggcagg gcggggcgta atttttttaa ggcagttatt ggtgccctta 1020 

aacgcctggt gctacgcctg aataagtgat aataagcgga tgaatggcag aaattcgtcg 1080 

aagcttgata tcgaattcct gcagcccggg ggatccacta gttctagagc ggccgccacc 1140 

5 gcggtggagc tccagctttt gttcccttta gtgagggtta attgcgcgca ggcctagcta 1200 

ggtaaagaaa aatacccttg attcttctaa taacccggcg gcccaaaatg ccgactcgga 12 60 

gcgaaagata tacctccccc ggggccggga ggtcgcgtca ccgaccacgc cgccggccca 1320 

ggcgacgcgc gacacggaca cctgtcccca aaaacgccac catcgcagcc acacacggag 1380 

cgcccggggc cctctggtca accccaggac acacgcggga gcagcgccgg gccggggacg 1440 

10 ccctcccggc cgcccgtgcc acacgcaggg ggccggcccg tgtctccaga gcgggagccg 1500 

gaagcatttt cggccggccc ctcctacgac cgggacacac gagggaccga aggccggcca 1560 

ggcgcgacct ctcgggccgc acgcgcgctc agggagcgct ctccgactcc gcacggggac 1620 

tcgccagaaa ggatcgtgac ctgcattaat gaatcagggg ataacgcagg aaagaacatg 1680 

tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 1740 

15 cataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 1800 

aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 1860 

cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 1920 

gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 1980 

ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 2040 

20 cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 2100 

aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 2160 

tacggctaca ctagaaggac agtatttggt atctgcgctc tgctgaagcc agttaccttc 2220 

ggaaaaagag ttggtagctc ttgatccggc aaacaaacca ccgctggtag cggtggtttt 2280 

tttgtttgca agcagcagat tacgcgcaga aaaaaaggat ctcaagaaga tcctttgatc 2340 

25 ttttctacgg ggtctgacgc tcagtggaac gaaaactcac gttaagggat tttggtcatg 2400 

agattatcaa aaaggatctt cacctagatc cttttaaatt aaaaatgaag ttttaaatca 2460 

atctaaagta tatatgagta aacttggtct gacagttacc aatgcttaat cagtgaggca 2520 

cctatctcag cgatctgtct atttcgttca tccatagttg cctgactccc cgtcgtgtag 2580 

ataactacga tacgggaggg cttaccatct ggccccagtg ctgcaatgat accgcgagac 2640 

30 ccacgctcac cggctccaga tttatcagca ataaaccagc cagccggaag ggccgagcgc 2700 

agaagtggtc ctgcaacttt atccgcctcc atccagtcta ttaattgttg ccgggaagct 2760 

agagtaagta gttcgccagt taatagtttg cgcaacgttg ttgccattgc tacaggcatc 2820 

gtggtgtcac gctcgtcgtt tggtatggct tcattcagct ccggttccca acgatcaagg 2880 

cgagttacat gatcccccat gttgtgcaaa aaagcggtta gctccttcgg tcctccgatc 2 940 

35 gttgtcagaa gtaagttggc cgcagtgtta tcactcatgg ttatggcagc actgcataat 3000 

tctcttactg tcatgccatc cgtaagatgc ttttctgtga ctggtgagta ctcaaccaag 3060 

tcattctgag aatagtgtat gcggcgaccg agttgctctt gcccggcgtc aacacgggat 3120 

aataccgcgc cacatagcag aactttaaaa gtgctcatca ttggaaaacg ttcttcgggg 3180 

cgaaaactct caaggatctt accgctgttg agatccagtt cgatgtaacc cactcgtgca 3240 

40 cccaactgat cttcagcatc ttttactttc accagcgttt ctgggtgagc aaaaacagga 3300 

aggcaaaatg ccgcaaaaaa gggaataagg gcgacacgga aatgttgaat actcatactc 3360 

ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag cggatacata 3420 

tttgaatgta tttagaaaaa taaacaaaag agtttgtaga aacgcaaaaa ggccatccgt 34 80 

caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc ctgcccgcca 3540 

45 ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg tcctactcag 3600 

gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg actgagcctt 3660 

tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc acactaccat 3720 

cggcgctacg gcgtttcact tctgagttcg gcatggggtc aggtgggacc accgcgctac 3780 

tgccgccagg caaattctgt tttatcagac cgcttctgcg ttctgattta atctgtatca 3840 

50 ggctgaaaat cttctctcat ccgccaaaac agaagctagc ggccgatc 3888 
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